| Cn-celeb: a challenging chinese speaker recognition dataset Y Fan, JW Kang, LT Li, KC Li, HL Chen, ST Cheng, PY Zhang, ZY Zhou, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 301 | 2020 |
| Spark-tts: An efficient llm-based text-to-speech model with single-stream decoupled speech tokens X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ... arXiv preprint arXiv:2503.01710, 2025 | 94 | 2025 |
| Harmony: Heterogeneous multi-modal federated learning through disentangled model training X Ouyang, Z Xie, H Fu, S Cheng, L Pan, N Ling, G Xing, J Zhou, J Huang Proceedings of the 21st Annual International Conference on Mobile Systems …, 2023 | 80 | 2023 |
| ADMarker: A Multi-Modal Federated Learning System for Monitoring Digital Biomarkers of Alzheimer's Disease X Ouyang, X Shuai, Y Li, L Pan, X Zhang, H Fu, S Cheng, X Wang, S Cao, ... Proceedings of the 30th Annual International Conference on Mobile Computing …, 2024 | 51 | 2024 |
| ASR-free pronunciation assessment S Cheng, Z Liu, L Li, Z Tang, D Wang, TF Zheng arXiv preprint arXiv:2005.11902, 2020 | 44 | 2020 |
| Both ears wide open: Towards language-driven spatial audio generation P Sun, S Cheng, X Li, Z Ye, H Liu, H Zhang, W Xue, Y Guo arXiv preprint arXiv:2410.10676, 2024 | 25 | 2024 |
| Audio-flan: A preliminary release L Xue, Z Zhou, J Pan, Z Li, S Fan, Y Ma, S Cheng, D Yang, H Guo, Y Xiao, ... arXiv preprint arXiv:2502.16584, 2025 | 3 | 2025 |
| AquaScan: A Sonar-based Underwater Sensing System for Human Activity Monitoring H Hou, B Zheng, S Cheng, X Zhao, P Wu, L He, Y Guo, G Xing, Z Yan Proceedings of the 31st Annual International Conference on Mobile Computing …, 2025 | | 2025 |
| UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice S Cheng, W Bian, X Wang, R Yuan, J Chen, S Yin, Y Guo, W Xue arXiv preprint arXiv:2509.21144, 2025 | | 2025 |