[go: up one dir, main page]

Follow
Jianwei Yu
Jianwei Yu
Tencent AI lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
Diffsound: Discrete diffusion model for text-to-sound generation
D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1720-1733, 2023
4412023
Music source separation with band-split RNN
Y Luo, J Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1893-1901, 2023
2092023
Speech emotion recognition using capsule networks
X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1522019
Adversarial attacks on GMM i-vector based speaker verification systems
X Li, J Zhong, X Wu, J Yu, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1232020
Audio-visual recognition of overlapped speech for the lrs2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1172020
Kimi-audio technical report
D Ding, Z Ju, Y Leng, S Liu, T Liu, Z Shang, K Shen, W Song, X Tan, ...
arXiv preprint arXiv:2504.18425, 2025
1132025
Recent progress in the CUHK dysarthric speech recognition system
S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021
1072021
Investigation of data augmentation techniques for disordered speech recognition
M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng
arXiv preprint arXiv:2201.05562, 2022
1052022
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng
Interspeech, 2938-2942, 2018
732018
Dirichlet graph variational autoencoder
J Li, J Yu, J Li, H Zhang, K Zhao, Y Rong, H Cheng, J Huang
Advances in Neural Information Processing Systems 33, 5274-5283, 2020
712020
Secap: Speech emotion captioning with large language model
Y Xu, H Chen, J Yu, Q Huang, Z Wu, SX Zhang, G Li, Y Luo, R Gu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19323 …, 2024
682024
The Sound Demixing Challenge 2023$\unicode {x2013} $ Music Demixing Track
G Fabbro, S Uhlich, CH Lai, W Choi, M Martínez-Ramírez, W Liao, ...
arXiv preprint arXiv:2308.06979, 2023
65*2023
High fidelity speech enhancement with band-split rnn
J Yu, Y Luo, H Chen, R Gu, C Weng
arXiv preprint arXiv:2212.00406, 2022
592022
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization
D Wang, J Yu, X Wu, L Sun, X Liu, H Meng
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
592021
A comparative study of acoustic and linguistic features classification for Alzheimer's disease detection
J Li, J Yu, Z Ye, S Wong, M Mak, B Mak, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
582021
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction
D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
582020
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus
Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
532021
End-to-end code-switched tts with mix of monolingual recordings
Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
532019
Muq: Self-supervised music representation learning with mel residual vector quantization
H Zhu, Y Zhou, H Chen, J Yu, Z Ma, R Gu, Y Luo, W Tan, X Chen
arXiv preprint arXiv:2501.01108, 2025
452025
Gaussian process lstm recurrent neural network language models for speech recognition
MWY Lam, X Chen, S Hu, J Yu, X Liu, H Meng
ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019
422019
The system can't perform the operation now. Try again later.
Articles 1–20