[go: up one dir, main page]

Follow
Sitong CHENG
Title
Cited by
Cited by
Year
Cn-celeb: a challenging chinese speaker recognition dataset
Y Fan, JW Kang, LT Li, KC Li, HL Chen, ST Cheng, PY Zhang, ZY Zhou, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
3012020
Spark-tts: An efficient llm-based text-to-speech model with single-stream decoupled speech tokens
X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ...
arXiv preprint arXiv:2503.01710, 2025
942025
Harmony: Heterogeneous multi-modal federated learning through disentangled model training
X Ouyang, Z Xie, H Fu, S Cheng, L Pan, N Ling, G Xing, J Zhou, J Huang
Proceedings of the 21st Annual International Conference on Mobile Systems …, 2023
802023
ADMarker: A Multi-Modal Federated Learning System for Monitoring Digital Biomarkers of Alzheimer's Disease
X Ouyang, X Shuai, Y Li, L Pan, X Zhang, H Fu, S Cheng, X Wang, S Cao, ...
Proceedings of the 30th Annual International Conference on Mobile Computing …, 2024
512024
ASR-free pronunciation assessment
S Cheng, Z Liu, L Li, Z Tang, D Wang, TF Zheng
arXiv preprint arXiv:2005.11902, 2020
442020
Both ears wide open: Towards language-driven spatial audio generation
P Sun, S Cheng, X Li, Z Ye, H Liu, H Zhang, W Xue, Y Guo
arXiv preprint arXiv:2410.10676, 2024
252024
Audio-flan: A preliminary release
L Xue, Z Zhou, J Pan, Z Li, S Fan, Y Ma, S Cheng, D Yang, H Guo, Y Xiao, ...
arXiv preprint arXiv:2502.16584, 2025
32025
AquaScan: A Sonar-based Underwater Sensing System for Human Activity Monitoring
H Hou, B Zheng, S Cheng, X Zhao, P Wu, L He, Y Guo, G Xing, Z Yan
Proceedings of the 31st Annual International Conference on Mobile Computing …, 2025
2025
UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice
S Cheng, W Bian, X Wang, R Yuan, J Chen, S Yin, Y Guo, W Xue
arXiv preprint arXiv:2509.21144, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–9