| An unsupervised deep domain adaptation approach for robust speech recognition S Sun, B Zhang, L Xie, Y Zhang Neurocomputing, 2017 | 206 | 2017 |
| Unsupervised domain adaptation via domain adversarial training for speaker recognition Q Wang, W Rao, S Sun, L Xie, ES Chng, H Li 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 188 | 2018 |
| Domain adversarial training for accented speech recognition S Sun, CF Yeh, MY Hwang, M Ostendorf, L Xie 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 164 | 2018 |
| Training Augmentation with Adversarial Examples for Robust Speech Recognition S Sun, CF Yeh, M Ostendorf, MY Hwang, L Xie | 87* | |
| An attention-based neural network approach for single channel speech enhancement X Hao, C Shan, Y Xu, S Sun, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 69 | 2019 |
| A study of learning based beamforming methods for speech recognition X Xiao, C Xu, Z Zhang, S Zhao, S Sun, S Watanabe, L Wang, L Xie, ... CHiME 2016 workshop, 26-31, 2016 | 55 | 2016 |
| Investigating generative adversarial networks based speech dereverberation for robust speech recognition K Wang, J Zhang, S Sun, Y Wang, F Xiang, L Xie arXiv preprint arXiv:1803.10132, 2018 | 54 | 2018 |
| Adversarial Regularization for End-to-End Robust Speaker Verification. Q Wang, P Guo, S Sun, L Xie, JHL Hansen Interspeech, 4010-4014, 2019 | 51 | 2019 |
| Adversarial examples for improving end-to-end attention-based small-footprint keyword spotting X Wang, S Sun, C Shan, J Hou, L Xie, S Li, X Lei ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 49 | 2019 |
| Tiny transducer: A highly-efficient speech recognition model on edge devices Y Zhang, S Sun, L Ma ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 40 | 2021 |
| Adversarial regularization for attention based end-to-end robust speech recognition S Sun, P Guo, L Xie, MY Hwang IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (11 …, 2019 | 38 | 2019 |
| Efficient conformer with prob-sparse attention mechanism for end-to-endspeech recognition X Wang, S Sun, L Xie, L Ma arXiv preprint arXiv:2106.09236, 2021 | 29 | 2021 |
| Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source. C Xu, X Xiao, S Sun, W Rao, ES Chng, H Li Interspeech, 1894-1898, 2017 | 25 | 2017 |
| The NNI Query-by-Example System for MediaEval 2015. J Hou, CCL Van Tung Pham, CC Leung, L Wang, H Xu, H Lv, L Xie, Z Fu, ... MediaEval, 2015 | 22 | 2015 |
| Improving streaming transformer based asr under a framework of self-supervised learning S Cao, Y Kang, Y Fu, X Xu, S Sun, Y Zhang, L Ma arXiv preprint arXiv:2109.07327, 2021 | 21 | 2021 |
| Two stage contextual word filtering for context bias in unified streaming and non-streaming transducer Z Yang, S Sun, X Wang, Y Zhang, L Ma, L Xie arXiv preprint arXiv:2301.06735, 2023 | 16 | 2023 |
| Self-supervised disentangled representation learning for robust target speech extraction Z Mu, X Yang, S Sun, Q Yang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18815 …, 2024 | 14 | 2024 |
| CaTT-KWS: A multi-stage customized keyword spotting framework based on cascaded transducer-transformer Z Yang, S Sun, J Li, X Zhang, X Wang, L Ma, L Xie arXiv preprint arXiv:2207.01267, 2022 | 14 | 2022 |
| Virtual adversarial training for DS-CNN based small-footprint keyword spotting X Wang, S Sun, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 14 | 2019 |
| Leveraging acoustic contextual representation by audio-textual cross-modal learning for conversational ASR K Wei, Y Zhang, S Sun, L Xie, L Ma arXiv preprint arXiv:2207.01039, 2022 | 13 | 2022 |