[go: up one dir, main page]

Follow
Wei Zou
Wei Zou
PKU、Samsung、Baidu、Didi、Ke
No verified email
Title
Cited by
Cited by
Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
3762021
Improving transformer-based speech recognition using unsupervised pre-training
D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li
arXiv preprint arXiv:1910.09932, 2019
1112019
C3ot: Generating shorter chain-of-thought without compromising effectiveness
Y Kang, X Sun, L Chen, W Zou
Proceedings of the AAAI Conference on Artificial Intelligence 39 (23), 24312 …, 2025
1092025
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning
D Jiang, W Li, M Cao, W Zou, X Li
arXiv preprint arXiv:2010.13991, 2020
1022020
From llm to conversational agent: A memory enhanced architecture with fine-tuning of large language models
N Liu, L Chen, X Tian, W Zou, K Chen, M Cui
arXiv preprint arXiv:2401.02777, 2024
732024
Kespeech: An open source speech dataset of mandarin and its eight subdialects
Z Tang, D Wang, Y Xu, J Sun, X Lei, S Zhao, C Wen, X Tan, C Xie, S Zhou, ...
Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021
702021
Didispeech: A large scale mandarin speech corpus
T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
652021
Towards end-to-end code-switching speech recognition
N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li
arXiv preprint arXiv:1810.13091, 2018
652018
A further study of unsupervised pretraining for transformer based speech recognition
D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
512021
Transformer based unsupervised pre-training for acoustic representation learning
R Zhang, H Wu, W Li, D Jiang, W Zou, X Li
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
442021
Comparable study of modeling units for end-to-end mandarin speech recognition
W Zou, D Jiang, S Zhao, G Yang, X Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
402018
Chathome: Development and evaluation of a domain-specific language model for home renovation
C Wen, X Sun, S Zhao, X Fang, L Chen, W Zou
arXiv preprint arXiv:2307.15290, 2023
372023
Audio deepfake detection system with neural stitching for add 2022
R Yan, C Wen, S Zhou, T Guo, W Zou, X Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
352022
Delta: A deep learning based language technology platform
K Han, J Chen, H Zhang, H Xu, Y Peng, Y Wang, N Ding, H Deng, Y Gao, ...
arXiv preprint arXiv:1908.01853, 2019
162019
Sari: Structured audio reasoning via curriculum-guided reinforcement learning
C Wen, T Guo, S Zhao, W Zou, X Li
arXiv preprint arXiv:2504.15900, 2025
122025
Semantic data augmentation for end-to-end mandarin speech recognition
J Sun, Z Tang, H Yin, W Wang, X Zhao, S Zhao, X Lei, W Zou, X Li
arXiv preprint arXiv:2104.12521, 2021
122021
Audio-visual wake word spotting system for misp challenge 2021
Y Xu, J Sun, Y Han, S Zhao, C Mei, T Guo, S Zhou, C Xie, W Zou, X Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
112022
GigaSpeech: An Evolving
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
Multi-domain ASR Corpus with 10, 2021-1965, 2021
112021
Why Not Transform Chat Large Language Models to Non-English?
X Geng, M Zhu, J Li, Z Lai, W Zou, S She, J Guo, X Zhao, Y Li, Y Li, C Su, ...
arXiv preprint arXiv:2405.13923, 2024
102024
DUMA: A dual-mind conversational agent with fast and slow thinking
X Tian, L Chen, N Liu, Y Liu, W Zou, K Chen, M Cui
arXiv preprint arXiv:2310.18075, 2023
72023
The system can't perform the operation now. Try again later.
Articles 1–20