Zhendong Peng

Cited by

	All	Since 2021
Citations	1090	1089
h-index	11	11
i10-index	11	11

400

200

100

300

20212022202320242025202622 126 222 385 317 17

Public access

View all

3 articles

1 article

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Shu-Tao XiaSIGS, Tsinghua UniversityVerified email at sz.tsinghua.edu.cn

Zhendong Peng

Tsinghua University

Verified email at tsinghua.org.cn

ASR


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit. Z Yao, Di Wu 0061, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, ... interspeech 2021, 4054-4058, 2021	352	2021
Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	323	2022
Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022	146	2022
U2++: Unified two-pass bidirectional end-to-end model for speech recognition D Wu, B Zhang, C Yang, Z Peng, W Xia, X Chen, X Lei arXiv preprint arXiv:2106.05642, 2021	69	2021
Zeroprompt: Streaming acoustic encoders are zero-shot masked lms X Song, D Wu, B Zhang, Z Peng, B Dang, F Pan, Z Wu arXiv preprint arXiv:2305.10649, 2023	32	2023
Wenet: Production first and production ready end-to-end speech recognition toolkit B Zhang, D Wu, C Yang, X Chen, Z Peng, X Wang, Z Yao, X Wang, F Yu, ... arXiv e-prints, arXiv: 2102.01547, 2021	30	2021
Lightgrad: Lightweight diffusion probabilistic model for text-to-speech J Chen, X Song, Z Peng, B Zhang, F Pan, Z Wu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	28	2023
ABFL: an autoencoder based practical approach for software fault localization Z Peng, X Xiao, G Hu, AK Sangaiah, M Atiquzzaman, S Xia Information sciences 510, 108-121, 2020	27	2020
Branch-ECAPA-TDNN: A Parallel Branch Architecture to Capture Local and Global Features for Speaker Verification. J Yao, C Liang, Z Peng, B Zhang, XL Zhang INTERSPEECH, 1943-1947, 2023	26	2023
Fast-u2++: Fast and accurate end-to-end speech recognition in joint ctc/attention frames C Liang, XL Zhang, BB Zhang, D Wu, S Li, X Song, Z Peng, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	15	2023
Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	13	2023
Touchtts: An embarrassingly simple tts framework that everyone can touch X Song, M Xing, C Ma, S Li, D Wu, B Zhang, F Pan, D Zhou, Y Zhang, ... arXiv preprint arXiv:2412.08237, 2024	9	2024
U2++ moe: Scaling 4.7 x parameters with minimal impact on rtf X Song, D Wu, B Zhang, D Zhou, Z Peng, B Dang, F Pan, C Yang arXiv preprint arXiv:2404.16407, 2024	9	2024
TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch X Song, C Liang, B Zhang, P Zhang, ZY Wang, Y Ma, M Xu, L Wang, ... arXiv preprint arXiv:2412.15622, 2024	3	2024
Fusionformer: Fusing operations in transformer for efficient streaming speech recognition X Song, D Wu, B Zhang, Z Wu, W Li, D Li, P Zhang, Z Peng, F Pan, C Zhu, ... arXiv preprint arXiv:2210.17079, 2022	2	2022
Non-local self-attention structure for function approximation in deep reinforcement learning Z Wang, X Xiao, G Hu, Y Yao, D Zhang, Z Peng, Q Li, S Xia ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	2	2019
MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis K An, Z Zhang, C Gao, Y Li, Z Peng, H Wang, Z Du, H Zhao, Z Gao, X Li arXiv preprint arXiv:2509.14784, 2025	1	2025
FunAudio-ASR Technical Report K An, Y Chen, C Deng, C Gao, Z Gao, B Gong, X Li, Y Li, X Lv, Y Ji, ... arXiv e-prints, arXiv: 2509.12508, 2025	1	2025
Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding H Zhou, X Song, B Fahy, Q Song, B Zhang, Z Peng, A Wadhawan, ... arXiv preprint arXiv:2506.12154, 2025	1	2025
Hydraformer: One Encoder for All Subsampling Rates Y Xu, X Song, Z Wu, D Wu, Z Peng, B Zhang 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024	1	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors