[go: up one dir, main page]

Follow
Leying ZHANG
Title
Cited by
Cited by
Year
Prompttts 2: Describing and generating voices with text prompt
Y Leng, Z Guo, K Shen, X Tan, Z Ju, Y Liu, Y Liu, D Yang, L Zhang, ...
arXiv preprint arXiv:2309.02285, 2023
672023
The sjtu x-lance lab system for cnsrc 2022
Z Chen, B Liu, B Han, L Zhang, Y Qian
arXiv preprint arXiv:2206.11699, 2022
272022
Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification
L Zhang, Z Chen, Y Qian
Proc. Interspeech 2021, 1897-1901, 2021
182021
CoVoMix: Advancing zero-shot speech generation for human-like multi-talker conversations
L Zhang, Y Qian, L Zhou, S Liu, D Wang, X Wang, M Yousefi, Y Qian, J Li, ...
Advances in Neural Information Processing Systems 37, 100291-100317, 2024
142024
DDTSE: Discriminative Diffusion Model for Target Speech Extraction
L Zhang, Y Qian, L Yu, H Wang, X Wang, H Yang, L Zhou, S Liu, Y Qian, ...
arXiv preprint arXiv:2309.13874, 2023
14*2023
Adaptive large margin fine-tuning for robust speaker verification
L Zhang, Z Chen, Y Qian
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Generation-based target speech extraction with speech discretization and vocoder
L Yu, W Zhang, C Du, L Zhang, Z Liang, Y Qian
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
92024
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification
L Zhang, Z Chen, Y Qian
Proc. Interspeech 2022, 311-315, 2022
72022
Slide: Integrating speech language model with llm for spontaneous spoken dialogue generation
H Lu, G Cheng, L Luo, L Zhang, Y Qian, P Zhang
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
62025
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching
L Zhang, Y Qian, X Wang, M Thakker, D Wang, J Yu, H Wu, Y Hu, J Li, ...
arXiv preprint arXiv:2506.00885, 2025
42025
Advanced zero-shot text-to-speech for background removal and preservation with controllable masked speech prediction
L Zhang, W Zhang, Z Chen, Y Qian
ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025
42025
Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling
L Zhang, W Zhang, C Li, Y Qian
arXiv preprint arXiv:2412.14890, 2024
32024
FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
J Li, Y Qian, Y Hu, L Zhang, X Wang, H Lu, M Thakker, J Li, S Zhao, Z Wu
arXiv preprint arXiv:2510.00981, 2025
12025
E2E-BPVC: End-to-End Background-Preserving Voice Conversion via In-Context Learning
Y Liu, Z Chen, L Zhang, Y Qian
Proc. Interspeech 2025, 1378-1382, 2025
12025
Knowledge Distillation from Discriminative Model to Generative Model with Parallel Architecture for Speech Enhancement
T Zhou, L Zhang, Y Qian
2024 IEEE 14th International Symposium on Chinese Spoken Language Processing …, 2024
12024
Training Text-to-Speech Model with Purely Synthetic Data: Feasibility, Sensitivity, and Generalization Capability
T Zhou, L Zhang, Z Chen, Y Qian
arXiv preprint arXiv:2512.17356, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–16