| Multi-band melgan: Faster waveform generation for high-quality text-to-speech G Yang, S Yang, K Liu, P Fang, W Chen, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 492-498, 2021 | 302 | 2021 |
| Controllable emotion transfer for end-to-end speech synthesis T Li, S Yang, L Xue, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 125 | 2021 |
| Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis Y Lei, S Yang, X Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022 | 112 | 2022 |
| Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis Y Lei, S Yang, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 423-430, 2021 | 83 | 2021 |
| A deep bidirectional LSTM approach for video-realistic talking head B Fan, L Xie, S Yang, L Wang, FK Soong Multimedia Tools and Applications 75 (9), 5287-5309, 2016 | 79 | 2016 |
| The role of blood vessels in high-resolution volume conductor head modeling of EEG LDJ Fiederer, J Vorwerk, F Lucka, M Dannhauer, S Yang, M Dümpelmann, ... NeuroImage 128, 193-208, 2016 | 72 | 2016 |
| Controlling emotion strength with relative attribute for end-to-end speech synthesis Z Xiaolian, Y Shan, X Geng, Yang, Lei 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 71 | 2019 |
| Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework S Yang, L Xie, X Chen, X Lou, X Zhu, D Huang, H Li 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 69 | 2017 |
| Accent and speaker disentanglement in many-to-many voice conversion Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 45 | 2021 |
| Controllable context-aware conversational speech synthesis J Cong, S Yang, N Hu, G Li, L Xie, D Su Interspeech, 2021, 4658-4662, 2021 | 44 | 2021 |
| Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis X Zhu, Y Zhang, S Yang, L Xue, L Xie IEEE Access 7, 65955-65964, 2019 | 42 | 2019 |
| Data efficient voice cloning from noisy samples with domain adversarial training J Cong, S Yang, L Xie, G Yu, G Wan arXiv preprint arXiv:2008.04265, 2020 | 40 | 2020 |
| On the localness modeling for the self-attention based end-to-end speech synthesis S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu Neural Networks 125, 121-130, 2020 | 38 | 2020 |
| Glow-wavegan: Learning speech representations from gan-based variational auto-encoder for high fidelity flow-based speech synthesis J Cong, S Yang, L Xie, D Su Interspeech, 2021, 2021 | 36 | 2021 |
| Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis X An, Y Wang, S Yang, Z Ma, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 26 | 2019 |
| Cross-speaker emotion transfer through information perturbation in emotional speech synthesis Y Lei, S Yang, X Zhu, L Xie, D Su IEEE Signal Processing Letters 29, 1948-1952, 2022 | 24 | 2022 |
| Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion Y Lei, S Yang, J Cong, L Xie, D Su Interspeech, 2022, 2022 | 21 | 2022 |
| Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias F Yang, S Yang, P Zhu, P Yan, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 21 | 2019 |
| Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis S Yang, H Lu, S Kang, L Xie, D Yu 2019 IEEE International Conference on Acoustics, Speech and Signal …, 2019 | 19 | 2019 |
| On the training of DNN-based average voice model for speech synthesis S Yang, Z Wu, L Xie 2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016 | 18 | 2016 |