| Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ... International Conference on Learning Representations (ICLR), 2023 | 325 | 2023 |
| Manifold Preserving Guided Diffusion Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ... International Conference on Learning Representations (ICLR), 2023 | 125 | 2023 |
| SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ... International Conference on Machine Learning (ICML), 20987-21012, 2022 | 100 | 2022 |
| GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration N Murata, K Saito, CH Lai, Y Takida, T Uesaka, Y Mitsufuji, S Ermon International Conference on Machine Learning (ICML), 25501-25522, 2023 | 82 | 2023 |
| FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon International Conference on Machine Learning (ICML), 18365-18398, 2023 | 56* | 2023 |
| Automatic Piano Transcription with Hierarchical Frequency-Time Transformer K Toyama, T Akama, Y Ikemiya, Y Takida, WH Liao, Y Mitsufuji 24th International Society for Music Information Retrieval Conference (ISMIR), 2023 | 47 | 2023 |
| Preventing oversmoothing in VAE via generalized variance parameterization Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji Neurocomputing 509, 137-156, 2022 | 45* | 2022 |
| Unsupervised vocal dereverberation with diffusion-based generative models K Saito, N Murata, T Uesaka, CH Lai, Y Takida, T Fukui, Y Mitsufuji IEEE International Conference on Acoustics, Speech and Signal Processing …, 2023 | 34 | 2023 |
| SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer Y Takida, M Imaizumi, T Shibuya, CH Lai, T Uesaka, N Murata, Y Mitsufuji International Conference on Learning Representations (ICLR), 2023 | 30 | 2023 |
| Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ... Interspeech, 2023 | 27 | 2023 |
| HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ... Transactions on Machine Learning Research (TMLR), 2023 | 25 | 2023 |
| Pagoda: Progressive growing of a one-step generator from a low-resolution diffusion teacher D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ... Annual Conference on Neural Information Processing Systems (NeurIPS), 2024 | 22 | 2024 |
| BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network T Shibuya, Y Takida, Y Mitsufuji ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023 | 22 | 2023 |
| Jump your steps: Optimizing sampling schedule of discrete diffusion models YH Park, CH Lai, S Hayakawa, Y Takida, Y Mitsufuji International Conference on Learning Representations (ICLR), 2024 | 21 | 2024 |
| Exterior and interior sound field separation using convex optimization: Comparison of signal models Y Takida, S Koyama, H Saruwataril 2018 26th European Signal Processing Conference (EUSIPCO), 2549-2553, 2018 | 18 | 2018 |
| Soundctm: Uniting score-based and consistency models for text-to-sound generation K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji International Conference on Learning Representations (ICLR), 2024 | 16 | 2024 |
| Distillation of Discrete Diffusion through Dimensional Correlations S Hayakawa, Y Takida, M Imaizumi, H Wakaki, Y Mitsufuji International Conference on Machine Learning (ICML), 2024 | 11 | 2024 |
| On the equivalence of consistency-type models: Consistency models, consistent diffusion models, and fokker-planck regularization CH Lai, Y Takida, T Uesaka, N Murata, Y Mitsufuji, S Ermon International Conference on Machine Learning 2023 Workshop SPIGM, 2023 | 11 | 2023 |
| Trasce: Trajectory steering for concept erasure A Jain, Y Kobayashi, T Shibuya, Y Takida, N Memon, J Togelius, ... arXiv preprint arXiv:2412.07658, 2024 | 9 | 2024 |
| Variable Bitrate Residual Vector Quantization for Audio Coding Y Chae, W Choi, Y Takida, J Koo, Y Ikemiya, Z Zhong, KW Cheuk, ... ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | 8 | 2025 |