[go: up one dir, main page]

Follow
Yuhta Takida
Yuhta Takida
Sony AI
Verified email at sony.com
Title
Cited by
Cited by
Year
Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion
D Kim, CH Lai, WH Liao, N Murata, Y Takida, T Uesaka, Y He, Y Mitsufuji, ...
International Conference on Learning Representations (ICLR), 2023
3252023
Manifold Preserving Guided Diffusion
Y He, N Murata, CH Lai, Y Takida, T Uesaka, D Kim, WH Liao, Y Mitsufuji, ...
International Conference on Learning Representations (ICLR), 2023
1252023
SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization
Y Takida, T Shibuya, WH Liao, CH Lai, J Ohmura, T Uesaka, N Murata, ...
International Conference on Machine Learning (ICML), 20987-21012, 2022
1002022
GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration
N Murata, K Saito, CH Lai, Y Takida, T Uesaka, Y Mitsufuji, S Ermon
International Conference on Machine Learning (ICML), 25501-25522, 2023
822023
FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation
CH Lai, Y Takida, N Murata, T Uesaka, Y Mitsufuji, S Ermon
International Conference on Machine Learning (ICML), 18365-18398, 2023
56*2023
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
K Toyama, T Akama, Y Ikemiya, Y Takida, WH Liao, Y Mitsufuji
24th International Society for Music Information Retrieval Conference (ISMIR), 2023
472023
Preventing oversmoothing in VAE via generalized variance parameterization
Y Takida, WH Liao, CH Lai, T Uesaka, S Takahashi, Y Mitsufuji
Neurocomputing 509, 137-156, 2022
45*2022
Unsupervised vocal dereverberation with diffusion-based generative models
K Saito, N Murata, T Uesaka, CH Lai, Y Takida, T Fukui, Y Mitsufuji
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2023
342023
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer
Y Takida, M Imaizumi, T Shibuya, CH Lai, T Uesaka, N Murata, Y Mitsufuji
International Conference on Learning Representations (ICLR), 2023
302023
Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement
R Sawata, N Murata, Y Takida, T Uesaka, T Shibuya, S Takahashi, ...
Interspeech, 2023
272023
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Y Takida, Y Ikemiya, T Shibuya, K Shimada, W Choi, CH Lai, N Murata, ...
Transactions on Machine Learning Research (TMLR), 2023
252023
Pagoda: Progressive growing of a one-step generator from a low-resolution diffusion teacher
D Kim, CH Lai, WH Liao, Y Takida, N Murata, T Uesaka, Y Mitsufuji, ...
Annual Conference on Neural Information Processing Systems (NeurIPS), 2024
222024
BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
T Shibuya, Y Takida, Y Mitsufuji
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
222023
Jump your steps: Optimizing sampling schedule of discrete diffusion models
YH Park, CH Lai, S Hayakawa, Y Takida, Y Mitsufuji
International Conference on Learning Representations (ICLR), 2024
212024
Exterior and interior sound field separation using convex optimization: Comparison of signal models
Y Takida, S Koyama, H Saruwataril
2018 26th European Signal Processing Conference (EUSIPCO), 2549-2553, 2018
182018
Soundctm: Uniting score-based and consistency models for text-to-sound generation
K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji
International Conference on Learning Representations (ICLR), 2024
162024
Distillation of Discrete Diffusion through Dimensional Correlations
S Hayakawa, Y Takida, M Imaizumi, H Wakaki, Y Mitsufuji
International Conference on Machine Learning (ICML), 2024
112024
On the equivalence of consistency-type models: Consistency models, consistent diffusion models, and fokker-planck regularization
CH Lai, Y Takida, T Uesaka, N Murata, Y Mitsufuji, S Ermon
International Conference on Machine Learning 2023 Workshop SPIGM, 2023
112023
Trasce: Trajectory steering for concept erasure
A Jain, Y Kobayashi, T Shibuya, Y Takida, N Memon, J Togelius, ...
arXiv preprint arXiv:2412.07658, 2024
92024
Variable Bitrate Residual Vector Quantization for Audio Coding
Y Chae, W Choi, Y Takida, J Koo, Y Ikemiya, Z Zhong, KW Cheuk, ...
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
82025
The system can't perform the operation now. Try again later.
Articles 1–20