[go: up one dir, main page]

Follow
Xixin Wu
Xixin Wu
Verified email at se.cuhk.edu.hk - Homepage
Title
Cited by
Cited by
Year
Uniaudio: An audio foundation model toward universal audio generation
D Yang, J Tian, X Tan, R Huang, S Liu, X Chang, J Shi, S Zhao, J Bian, ...
arXiv preprint arXiv:2310.00704, 2023
1982023
Speech emotion recognition using capsule networks
X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1532019
Any-to-many voice conversion with location-relative sequence-to-sequence modeling
S Liu, Y Cao, D Wang, X Wu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1717-1728, 2021
1442021
Adversarial attacks on GMM i-vector based speaker verification systems
X Li, J Zhong, X Wu, J Yu, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1242020
Sail: Search-augmented instruction learning
H Luo, YS Chuang, Y Gong, T Zhang, Y Kim, X Wu, D Fox, H Meng, ...
arXiv preprint arXiv:2305.15225, 2023
1122023
Channel-wise gated res2net: Towards robust detection of synthetic speech attacks
X Li, X Wu, H Lu, X Liu, H Meng
arXiv preprint arXiv:2107.08803, 2021
1012021
Autoregressive speech synthesis without vector quantization
L Meng, L Zhou, S Liu, S Chen, B Han, S Hu, Y Liu, J Li, S Zhao, X Wu, ...
Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025
842025
Learning discriminative features from spectrograms using center loss for speech emotion recognition
D Dai, Z Wu, R Li, X Wu, J Jia, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
732019
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng
Interspeech, 2938-2942, 2018
732018
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance.
S Liu, J Zhong, L Sun, X Wu, X Liu, H Meng
Interspeech, 496-500, 2018
712018
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization
D Wang, J Yu, X Wu, L Sun, X Liu, H Meng
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
592021
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction
D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
592020
End-to-end accent conversion without using native utterances
S Liu, D Wang, Y Cao, L Sun, X Wu, S Kang, Z Wu, X Liu, D Su, D Yu, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
592020
Interpretable unified language checking
T Zhang, H Luo, YS Chuang, W Fang, L Gaitskell, T Hartvigsen, X Wu, ...
arXiv preprint arXiv:2304.03728, 2023
582023
Investigating robustness of adversarial samples detection for automatic speaker verification
X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng
arXiv preprint arXiv:2006.06186, 2020
572020
Uniaudio: Towards universal audio generation with large language models
D Yang, J Tian, X Tan, R Huang, S Liu, H Guo, X Chang, J Shi, J Bian, ...
Forty-first International Conference on Machine Learning, 2024
552024
End-to-end code-switched tts with mix of monolingual recordings
Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
542019
Simplespeech 2: Towards simple and efficient text-to-speech with flow-based scalar latent transformer diffusion models
D Yang, R Huang, Y Wang, H Guo, D Chong, S Liu, X Wu, H Meng
IEEE Transactions on Audio, Speech and Language Processing, 2025
492025
Rethinking Machine Ethics–Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?
J Zhou, M Hu, J Li, X Zhang, X Wu, I King, H Meng
Findings of the Association for Computational Linguistics: NAACL 2024, 2227-2242, 2024
482024
Speech emotion recognition using sequential capsule networks
X Wu, Y Cao, H Lu, S Liu, D Wang, Z Wu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3280-3291, 2021
422021
The system can't perform the operation now. Try again later.
Articles 1–20