| WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit. Z Yao, Di Wu 0061, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, ... interspeech 2021, 4054-4058, 2021 | 352 | 2021 |
| Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 323 | 2022 |
| Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022 | 146 | 2022 |
| U2++: Unified two-pass bidirectional end-to-end model for speech recognition D Wu, B Zhang, C Yang, Z Peng, W Xia, X Chen, X Lei arXiv preprint arXiv:2106.05642, 2021 | 69 | 2021 |
| Zeroprompt: Streaming acoustic encoders are zero-shot masked lms X Song, D Wu, B Zhang, Z Peng, B Dang, F Pan, Z Wu arXiv preprint arXiv:2305.10649, 2023 | 32 | 2023 |
| Wenet: Production first and production ready end-to-end speech recognition toolkit B Zhang, D Wu, C Yang, X Chen, Z Peng, X Wang, Z Yao, X Wang, F Yu, ... arXiv e-prints, arXiv: 2102.01547, 2021 | 30 | 2021 |
| Lightgrad: Lightweight diffusion probabilistic model for text-to-speech J Chen, X Song, Z Peng, B Zhang, F Pan, Z Wu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 28 | 2023 |
| ABFL: an autoencoder based practical approach for software fault localization Z Peng, X Xiao, G Hu, AK Sangaiah, M Atiquzzaman, S Xia Information sciences 510, 108-121, 2020 | 27 | 2020 |
| Branch-ECAPA-TDNN: A Parallel Branch Architecture to Capture Local and Global Features for Speaker Verification. J Yao, C Liang, Z Peng, B Zhang, XL Zhang INTERSPEECH, 1943-1947, 2023 | 26 | 2023 |
| Fast-u2++: Fast and accurate end-to-end speech recognition in joint ctc/attention frames C Liang, XL Zhang, BB Zhang, D Wu, S Li, X Song, Z Peng, F Pan ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 15 | 2023 |
| Trimtail: Low-latency streaming asr with simple but effective spectrogram-level length penalty X Song, D Wu, Z Wu, B Zhang, Y Zhang, Z Peng, W Li, F Pan, C Zhu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 13 | 2023 |
| Touchtts: An embarrassingly simple tts framework that everyone can touch X Song, M Xing, C Ma, S Li, D Wu, B Zhang, F Pan, D Zhou, Y Zhang, ... arXiv preprint arXiv:2412.08237, 2024 | 9 | 2024 |
| U2++ moe: Scaling 4.7 x parameters with minimal impact on rtf X Song, D Wu, B Zhang, D Zhou, Z Peng, B Dang, F Pan, C Yang arXiv preprint arXiv:2404.16407, 2024 | 9 | 2024 |
| TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch X Song, C Liang, B Zhang, P Zhang, ZY Wang, Y Ma, M Xu, L Wang, ... arXiv preprint arXiv:2412.15622, 2024 | 3 | 2024 |
| Fusionformer: Fusing operations in transformer for efficient streaming speech recognition X Song, D Wu, B Zhang, Z Wu, W Li, D Li, P Zhang, Z Peng, F Pan, C Zhu, ... arXiv preprint arXiv:2210.17079, 2022 | 2 | 2022 |
| Non-local self-attention structure for function approximation in deep reinforcement learning Z Wang, X Xiao, G Hu, Y Yao, D Zhang, Z Peng, Q Li, S Xia ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 2 | 2019 |
| MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis K An, Z Zhang, C Gao, Y Li, Z Peng, H Wang, Z Du, H Zhao, Z Gao, X Li arXiv preprint arXiv:2509.14784, 2025 | 1 | 2025 |
| FunAudio-ASR Technical Report K An, Y Chen, C Deng, C Gao, Z Gao, B Gong, X Li, Y Li, X Lv, Y Ji, ... arXiv e-prints, arXiv: 2509.12508, 2025 | 1 | 2025 |
| Adapting Whisper for Streaming Speech Recognition via Two-Pass Decoding H Zhou, X Song, B Fahy, Q Song, B Zhang, Z Peng, A Wadhawan, ... arXiv preprint arXiv:2506.12154, 2025 | 1 | 2025 |
| Hydraformer: One Encoder for All Subsampling Rates Y Xu, X Song, Z Wu, D Wu, Z Peng, B Zhang 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | 1 | 2024 |