| Diffsound: Discrete diffusion model for text-to-sound generation D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1720-1733, 2023 | 441 | 2023 |
| Music source separation with band-split RNN Y Luo, J Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1893-1901, 2023 | 209 | 2023 |
| Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 152 | 2019 |
| Adversarial attacks on GMM i-vector based speaker verification systems X Li, J Zhong, X Wu, J Yu, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 123 | 2020 |
| Audio-visual recognition of overlapped speech for the lrs2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 117 | 2020 |
| Kimi-audio technical report D Ding, Z Ju, Y Leng, S Liu, T Liu, Z Shang, K Shen, W Song, X Tan, ... arXiv preprint arXiv:2504.18425, 2025 | 113 | 2025 |
| Recent progress in the CUHK dysarthric speech recognition system S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021 | 107 | 2021 |
| Investigation of data augmentation techniques for disordered speech recognition M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng arXiv preprint arXiv:2201.05562, 2022 | 105 | 2022 |
| Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng Interspeech, 2938-2942, 2018 | 73 | 2018 |
| Dirichlet graph variational autoencoder J Li, J Yu, J Li, H Zhang, K Zhao, Y Rong, H Cheng, J Huang Advances in Neural Information Processing Systems 33, 5274-5283, 2020 | 71 | 2020 |
| Secap: Speech emotion captioning with large language model Y Xu, H Chen, J Yu, Q Huang, Z Wu, SX Zhang, G Li, Y Luo, R Gu Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19323 …, 2024 | 68 | 2024 |
| The Sound Demixing Challenge 2023$\unicode {x2013} $ Music Demixing Track G Fabbro, S Uhlich, CH Lai, W Choi, M Martínez-Ramírez, W Liao, ... arXiv preprint arXiv:2308.06979, 2023 | 65* | 2023 |
| High fidelity speech enhancement with band-split rnn J Yu, Y Luo, H Chen, R Gu, C Weng arXiv preprint arXiv:2212.00406, 2022 | 59 | 2022 |
| Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 59 | 2021 |
| A comparative study of acoustic and linguistic features classification for Alzheimer's disease detection J Li, J Yu, Z Ye, S Wong, M Mak, B Mak, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 58 | 2021 |
| End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 58 | 2020 |
| Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 53 | 2021 |
| End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 53 | 2019 |
| Muq: Self-supervised music representation learning with mel residual vector quantization H Zhu, Y Zhou, H Chen, J Yu, Z Ma, R Gu, Y Luo, W Tan, X Chen arXiv preprint arXiv:2501.01108, 2025 | 45 | 2025 |
| Gaussian process lstm recurrent neural network language models for speech recognition MWY Lam, X Chen, S Hu, J Yu, X Liu, H Meng ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019 | 42 | 2019 |