Wei Zou

Cited by

	All	Since 2021
Citations	1282	1222
h-index	14	14
i10-index	19	19

460

230

115

345

2019202020212022202320242025202613 41 102 159 210 275 443 29

Public access

View all

0 articles

1 article

available

not available

Based on funding mandates

Co-authors

Shuaijiang ZhaoKE DIDI BAIDU PKUVerified email at pku.edu.cn
Dongwei JiangJohns Hopkins UniversityVerified email at jhu.edu
Cheng Wen（文成）Verified email at ke.com
Jiayu DUAlibaba DAMO AcademyVerified email at alibaba-inc.com
Guanbo WangJohns Hopkins UniversityVerified email at jhu.edu
Shuran ZhouUniversity of WashingtonVerified email at uw.edu
Kun HanFacebookVerified email at cse.ohio-state.edu
Jan "Yenda" TrmalAppTek AIVerified email at apptek.com
Dan SuTencent AI LabVerified email at tencent.com
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Wei-Qiang Zhang (张卫强)Tsinghua University (清华大学)Verified email at tsinghua.edu.cn
zhao youtencent ai-labVerified email at tencent.com
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Guoguo ChenSeasalt.ai, Vobil.com, Baidu, KITT.AIVerified email at seasalt.ai
Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Chao WengAnuttaconVerified email at anuttacon.com
Zhiyuan TangTencent, Tsinghua University, University of Chinese Academy of SciencesVerified email at tsinghua.edu.cn
Haiyang XuTongyi Lab, Alibaba Group, DIDI AI LABS, SEUVerified email at seu.edu.cn
Ying LyuDiDi Research America, University of Southern CaliforniaVerified email at airbnb.com
Longbiao WangProfessor, Tianjin UniversityVerified email at tju.edu.cn

Wei Zou

PKU、Samsung、Baidu、Didi、Ke

No verified email

Speech NLP LLM Multimodal


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	376	2021
Improving transformer-based speech recognition using unsupervised pre-training D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li arXiv preprint arXiv:1910.09932, 2019	111	2019
C3ot: Generating shorter chain-of-thought without compromising effectiveness Y Kang, X Sun, L Chen, W Zou Proceedings of the AAAI Conference on Artificial Intelligence 39 (23), 24312 …, 2025	109	2025
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning D Jiang, W Li, M Cao, W Zou, X Li arXiv preprint arXiv:2010.13991, 2020	102	2020
From llm to conversational agent: A memory enhanced architecture with fine-tuning of large language models N Liu, L Chen, X Tian, W Zou, K Chen, M Cui arXiv preprint arXiv:2401.02777, 2024	73	2024
Kespeech: An open source speech dataset of mandarin and its eight subdialects Z Tang, D Wang, Y Xu, J Sun, X Lei, S Zhao, C Wen, X Tan, C Xie, S Zhou, ... Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021	70	2021
Didispeech: A large scale mandarin speech corpus T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	65	2021
Towards end-to-end code-switching speech recognition N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li arXiv preprint arXiv:1810.13091, 2018	65	2018
A further study of unsupervised pretraining for transformer based speech recognition D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	51	2021
Transformer based unsupervised pre-training for acoustic representation learning R Zhang, H Wu, W Li, D Jiang, W Zou, X Li ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021	44	2021
Comparable study of modeling units for end-to-end mandarin speech recognition W Zou, D Jiang, S Zhao, G Yang, X Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018	40	2018
Chathome: Development and evaluation of a domain-specific language model for home renovation C Wen, X Sun, S Zhao, X Fang, L Chen, W Zou arXiv preprint arXiv:2307.15290, 2023	37	2023
Audio deepfake detection system with neural stitching for add 2022 R Yan, C Wen, S Zhou, T Guo, W Zou, X Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	35	2022
Delta: A deep learning based language technology platform K Han, J Chen, H Zhang, H Xu, Y Peng, Y Wang, N Ding, H Deng, Y Gao, ... arXiv preprint arXiv:1908.01853, 2019	16	2019
Sari: Structured audio reasoning via curriculum-guided reinforcement learning C Wen, T Guo, S Zhao, W Zou, X Li arXiv preprint arXiv:2504.15900, 2025	12	2025
Semantic data augmentation for end-to-end mandarin speech recognition J Sun, Z Tang, H Yin, W Wang, X Zhao, S Zhao, X Lei, W Zou, X Li arXiv preprint arXiv:2104.12521, 2021	12	2021
Audio-visual wake word spotting system for misp challenge 2021 Y Xu, J Sun, Y Han, S Zhao, C Mei, T Guo, S Zhou, C Xie, W Zou, X Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	11	2022
GigaSpeech: An Evolving G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... Multi-domain ASR Corpus with 10, 2021-1965, 2021	11	2021
Why Not Transform Chat Large Language Models to Non-English? X Geng, M Zhu, J Li, Z Lai, W Zou, S She, J Guo, X Zhao, Y Li, Y Li, C Su, ... arXiv preprint arXiv:2405.13923, 2024	10	2024
DUMA: A dual-mind conversational agent with fast and slow thinking X Tian, L Chen, N Liu, Y Liu, W Zou, K Chen, M Cui arXiv preprint arXiv:2310.18075, 2023	7	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors