| Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023 | 67 | 2023 |
| The sjtu x-lance lab system for cnsrc 2022 Z Chen, B Liu, B Han, L Zhang, Y Qian arXiv preprint arXiv:2206.11699, 2022 | 27 | 2022 |
| Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification L Zhang, Z Chen, Y Qian Proc. Interspeech 2021, 1897-1901, 2021 | 18 | 2021 |
| CoVoMix: Advancing zero-shot speech generation for human-like multi-talker conversations L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ... Advances in Neural Information Processing Systems 37, 100291-100317, 2024 | 14 | 2024 |
| DDTSE: Discriminative Diffusion Model for Target Speech Extraction L Zhang, Y Qian, L Yu, H Wang, X Wang, H Yang, L Zhou, S Liu, Y Qian, ... arXiv preprint arXiv:2309.13874, 2023 | 14* | 2023 |
| Adaptive large margin fine-tuning for robust speaker verification L Zhang, Z Chen, Y Qian ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 11 | 2023 |
| Generation-based target speech extraction with speech discretization and vocoder L Yu, W Zhang, C Du, L Zhang, Z Liang, Y Qian ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
| Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification L Zhang, Z Chen, Y Qian Proc. Interspeech 2022, 311-315, 2022 | 7 | 2022 |
| Slide: Integrating speech language model with llm for spontaneous spoken dialogue generation H Lu, G Cheng, L Luo, L Zhang, Y Qian, P Zhang ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | 6 | 2025 |
| CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching L Zhang, Y Qian, X Wang, M Thakker, D Wang, J Yu, H Wu, Y Hu, J Li, ... arXiv preprint arXiv:2506.00885, 2025 | 4 | 2025 |
| Advanced zero-shot text-to-speech for background removal and preservation with controllable masked speech prediction L Zhang, W Zhang, Z Chen, Y Qian ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | 4 | 2025 |
| Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling L Zhang, W Zhang, C Li, Y Qian arXiv preprint arXiv:2412.14890, 2024 | 3 | 2024 |
| FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates J Li, Y Qian, Y Hu, L Zhang, X Wang, H Lu, M Thakker, J Li, S Zhao, Z Wu arXiv preprint arXiv:2510.00981, 2025 | 1 | 2025 |
| E2E-BPVC: End-to-End Background-Preserving Voice Conversion via In-Context Learning Y Liu, Z Chen, L Zhang, Y Qian Proc. Interspeech 2025, 1378-1382, 2025 | 1 | 2025 |
| Knowledge Distillation from Discriminative Model to Generative Model with Parallel Architecture for Speech Enhancement T Zhou, L Zhang, Y Qian 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024 | 1 | 2024 |
| Training Text-to-Speech Model with Purely Synthetic Data: Feasibility, Sensitivity, and Generalization Capability T Zhou, L Zhang, Z Chen, Y Qian arXiv preprint arXiv:2512.17356, 2025 | | 2025 |