Jianwei Yu

Cited by

	All	Since 2021
Citations	3095	2970
h-index	27	27
i10-index	65	64

1100

550

275

825

2019202020212022202320242025202628 90 224 357 480 761 1100 42

Public access

View all

33 articles

0 articles

available

not available

Based on funding mandates

Jianwei Yu

Tencent AI lab

Verified email at tencent.com

ASR


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Diffsound: Discrete diffusion model for text-to-sound generation D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1720-1733, 2023	441	2023
Music source separation with band-split RNN Y Luo, J Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1893-1901, 2023	209	2023
Speech emotion recognition using capsule networks X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	152	2019
Adversarial attacks on GMM i-vector based speaker verification systems X Li, J Zhong, X Wu, J Yu, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	123	2020
Audio-visual recognition of overlapped speech for the lrs2 dataset J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	117	2020
Kimi-audio technical report D Ding, Z Ju, Y Leng, S Liu, T Liu, Z Shang, K Shen, W Song, X Tan, ... arXiv preprint arXiv:2504.18425, 2025	113	2025
Recent progress in the CUHK dysarthric speech recognition system S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021	107	2021
Investigation of data augmentation techniques for disordered speech recognition M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng arXiv preprint arXiv:2201.05562, 2022	105	2022
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng Interspeech, 2938-2942, 2018	73	2018
Dirichlet graph variational autoencoder J Li, J Yu, J Li, H Zhang, K Zhao, Y Rong, H Cheng, J Huang Advances in Neural Information Processing Systems 33, 5274-5283, 2020	71	2020
Secap: Speech emotion captioning with large language model Y Xu, H Chen, J Yu, Q Huang, Z Wu, SX Zhang, G Li, Y Luo, R Gu Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19323 …, 2024	68	2024
The Sound Demixing Challenge 2023$\unicode {x2013} $ Music Demixing Track G Fabbro, S Uhlich, CH Lai, W Choi, M Martínez-Ramírez, W Liao, ... arXiv preprint arXiv:2308.06979, 2023	65*	2023
High fidelity speech enhancement with band-split rnn J Yu, Y Luo, H Chen, R Gu, C Weng arXiv preprint arXiv:2212.00406, 2022	59	2022
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization D Wang, J Yu, X Wu, L Sun, X Liu, H Meng 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021	59	2021
A comparative study of acoustic and linguistic features classification for Alzheimer's disease detection J Li, J Yu, Z Ye, S Wong, M Mak, B Mak, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	58	2021
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	58	2020
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	53	2021
End-to-end code-switched tts with mix of monolingual recordings Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	53	2019
Muq: Self-supervised music representation learning with mel residual vector quantization H Zhu, Y Zhou, H Chen, J Yu, Z Ma, R Gu, Y Luo, W Tan, X Chen arXiv preprint arXiv:2501.01108, 2025	45	2025
Gaussian process lstm recurrent neural network language models for speech recognition MWY Lam, X Chen, S Hu, J Yu, X Liu, H Meng ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019	42	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by