‪Leying ZHANG‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2021
Citations	187	187
h-index	7	7
i10-index	6	6

0

120

60

30

90

202220232024202520264 17 57 106 3

Public access

6 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yanmin QianProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Zhengyang Chen (陈正阳)ByteDance; Shanghai Jiao Tong UniversityVerified email at bytedance.com
Yao QianMicrosoftVerified email at microsoft.com

Leying ZHANG

Leying ZHANG

Shanghai Jiao Tong University

Verified email at sjtu.edu.cn

Machine learning Speech generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Prompttts 2: Describing and generating voices with text prompt Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ... arXiv preprint arXiv:2309.02285, 2023	67	2023
The sjtu x-lance lab system for cnsrc 2022 Z Chen, B Liu, B Han, L Zhang, Y Qian arXiv preprint arXiv:2206.11699, 2022	27	2022
Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification L Zhang, Z Chen, Y Qian Proc. Interspeech 2021, 1897-1901, 2021	18	2021
CoVoMix: Advancing zero-shot speech generation for human-like multi-talker conversations L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ... Advances in Neural Information Processing Systems 37, 100291-100317, 2024	14	2024
DDTSE: Discriminative Diffusion Model for Target Speech Extraction L Zhang, Y Qian, L Yu, H Wang, X Wang, H Yang, L Zhou, S Liu, Y Qian, ... arXiv preprint arXiv:2309.13874, 2023	14*	2023
Adaptive large margin fine-tuning for robust speaker verification L Zhang, Z Chen, Y Qian ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	11	2023
Generation-based target speech extraction with speech discretization and vocoder L Yu, W Zhang, C Du, L Zhang, Z Liang, Y Qian ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	9	2024
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification L Zhang, Z Chen, Y Qian Proc. Interspeech 2022, 311-315, 2022	7	2022
Slide: Integrating speech language model with llm for spontaneous spoken dialogue generation H Lu, G Cheng, L Luo, L Zhang, Y Qian, P Zhang ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025	6	2025
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching L Zhang, Y Qian, X Wang, M Thakker, D Wang, J Yu, H Wu, Y Hu, J Li, ... arXiv preprint arXiv:2506.00885, 2025	4	2025
Advanced zero-shot text-to-speech for background removal and preservation with controllable masked speech prediction L Zhang, W Zhang, Z Chen, Y Qian ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025	4	2025
Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling L Zhang, W Zhang, C Li, Y Qian arXiv preprint arXiv:2412.14890, 2024	3	2024
FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates J Li, Y Qian, Y Hu, L Zhang, X Wang, H Lu, M Thakker, J Li, S Zhao, Z Wu arXiv preprint arXiv:2510.00981, 2025	1	2025
E2E-BPVC: End-to-End Background-Preserving Voice Conversion via In-Context Learning Y Liu, Z Chen, L Zhang, Y Qian Proc. Interspeech 2025, 1378-1382, 2025	1	2025
Knowledge Distillation from Discriminative Model to Generative Model with Parallel Architecture for Speech Enhancement T Zhou, L Zhang, Y Qian 2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024	1	2024
Training Text-to-Speech Model with Purely Synthetic Data: Feasibility, Sensitivity, and Generalization Capability T Zhou, L Zhang, Z Chen, Y Qian arXiv preprint arXiv:2512.17356, 2025		2025

The system can't perform the operation now. Try again later.

Articles 1–16