[go: up one dir, main page]

Follow
Shan Yang
Shan Yang
Tencent AI Lab
Verified email at nwpu-aslp.org
Title
Cited by
Cited by
Year
Multi-band melgan: Faster waveform generation for high-quality text-to-speech
G Yang, S Yang, K Liu, P Fang, W Chen, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 492-498, 2021
3022021
Controllable emotion transfer for end-to-end speech synthesis
T Li, S Yang, L Xue, L Xie
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
1252021
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis
Y Lei, S Yang, X Wang, L Xie
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022
1122022
Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis
Y Lei, S Yang, L Xie
2021 IEEE Spoken Language Technology Workshop (SLT), 423-430, 2021
832021
A deep bidirectional LSTM approach for video-realistic talking head
B Fan, L Xie, S Yang, L Wang, FK Soong
Multimedia Tools and Applications 75 (9), 5287-5309, 2016
792016
The role of blood vessels in high-resolution volume conductor head modeling of EEG
LDJ Fiederer, J Vorwerk, F Lucka, M Dannhauer, S Yang, M Dümpelmann, ...
NeuroImage 128, 193-208, 2016
722016
Controlling emotion strength with relative attribute for end-to-end speech synthesis
Z Xiaolian, Y Shan, X Geng, Yang, Lei
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
712019
Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework
S Yang, L Xie, X Chen, X Lou, X Zhu, D Huang, H Li
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
692017
Accent and speaker disentanglement in many-to-many voice conversion
Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
452021
Controllable context-aware conversational speech synthesis
J Cong, S Yang, N Hu, G Li, L Xie, D Su
Interspeech, 2021, 4658-4662, 2021
442021
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis
X Zhu, Y Zhang, S Yang, L Xue, L Xie
IEEE Access 7, 65955-65964, 2019
422019
Data efficient voice cloning from noisy samples with domain adversarial training
J Cong, S Yang, L Xie, G Yu, G Wan
arXiv preprint arXiv:2008.04265, 2020
402020
On the localness modeling for the self-attention based end-to-end speech synthesis
S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu
Neural Networks 125, 121-130, 2020
382020
Glow-wavegan: Learning speech representations from gan-based variational auto-encoder for high fidelity flow-based speech synthesis
J Cong, S Yang, L Xie, D Su
Interspeech, 2021, 2021
362021
Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis
X An, Y Wang, S Yang, Z Ma, L Xie
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
262019
Cross-speaker emotion transfer through information perturbation in emotional speech synthesis
Y Lei, S Yang, X Zhu, L Xie, D Su
IEEE Signal Processing Letters 29, 1948-1952, 2022
242022
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion
Y Lei, S Yang, J Cong, L Xie, D Su
Interspeech, 2022, 2022
212022
Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias
F Yang, S Yang, P Zhu, P Yan, L Xie
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
212019
Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis
S Yang, H Lu, S Kang, L Xie, D Yu
2019 IEEE International Conference on Acoustics, Speech and Signal …, 2019
192019
On the training of DNN-based average voice model for speech synthesis
S Yang, Z Wu, L Xie
2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016
182016
The system can't perform the operation now. Try again later.
Articles 1–20