| Uniaudio: An audio foundation model toward universal audio generation D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ... arXiv preprint arXiv:2310.00704, 2023 | 198 | 2023 |
| Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 153 | 2019 |
| Any-to-many voice conversion with location-relative sequence-to-sequence modeling S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1717-1728, 2021 | 144 | 2021 |
| Adversarial attacks on GMM i-vector based speaker verification systems X Li, J Zhong, X Wu, J Yu, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 124 | 2020 |
| Sail: Search-augmented instruction learning H Luo, YS Chuang, Y Gong, T Zhang, Y Kim, X Wu, D Fox, H Meng, ... arXiv preprint arXiv:2305.15225, 2023 | 112 | 2023 |
| Channel-wise gated res2net: Towards robust detection of synthetic speech attacks X Li, X Wu, H Lu, X Liu, H Meng arXiv preprint arXiv:2107.08803, 2021 | 101 | 2021 |
| Autoregressive speech synthesis without vector quantization L Meng, L Zhou, S Liu, S Chen, B Han, S Hu, Y Liu, J Li, S Zhao, X Wu, ... Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025 | 84 | 2025 |
| Learning discriminative features from spectrograms using center loss for speech emotion recognition D Dai, Z Wu, R Li, X Wu, J Jia, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 73 | 2019 |
| Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng Interspeech, 2938-2942, 2018 | 73 | 2018 |
| Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng Interspeech, 496-500, 2018 | 71 | 2018 |
| Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 59 | 2021 |
| End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 59 | 2020 |
| End-to-end accent conversion without using native utterances S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 59 | 2020 |
| Interpretable unified language checking T Zhang, H Luo, YS Chuang, W Fang, L Gaitskell, T Hartvigsen, X Wu, ... arXiv preprint arXiv:2304.03728, 2023 | 58 | 2023 |
| Investigating robustness of adversarial samples detection for automatic speaker verification X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng arXiv preprint arXiv:2006.06186, 2020 | 57 | 2020 |
| Uniaudio: Towards universal audio generation with large language models D Yang, J Tian, X Tan, R Huang, S Liu, H Guo, X Chang, J Shi, J Bian, ... Forty-first International Conference on Machine Learning, 2024 | 55 | 2024 |
| End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 54 | 2019 |
| Simplespeech 2: Towards simple and efficient text-to-speech with flow-based scalar latent transformer diffusion models D Yang, R Huang, Y Wang, H Guo, D Chong, S Liu, X Wu, H Meng IEEE Transactions on Audio, Speech and Language Processing, 2025 | 49 | 2025 |
| Rethinking Machine Ethics–Can LLMs Perform Moral Reasoning through the Lens of Moral Theories? J Zhou, M Hu, J Li, X Zhang, X Wu, I King, H Meng Findings of the Association for Computational Linguistics: NAACL 2024, 2227-2242, 2024 | 48 | 2024 |
| Speech emotion recognition using sequential capsule networks X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021 | 42 | 2021 |