Sitong CHENG

Cited by

	All	Since 2021
Citations	598	587
h-index	6	6
i10-index	6	6

280

140

210

202020212022202320242025202610 32 88 74 108 264 21

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Wei XueHKUSTVerified email at ust.hk
Yike GuoDept of CSE, The Hong Kong University of Science and TechnologyVerified email at ust.hk

Sitong CHENG

Other namesCheng Sitong, S.T Cheng

The Hong Kong University of Science and Technology

Verified email at connect.ust.hk - Homepage

audio generation speech translation audio understanding


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Cn-celeb: a challenging chinese speaker recognition dataset Y Fan, JW Kang, LT Li, KC Li, HL Chen, ST Cheng, PY Zhang, ZY Zhou, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	301	2020
Spark-tts: An efficient llm-based text-to-speech model with single-stream decoupled speech tokens X Wang, M Jiang, Z Ma, Z Zhang, S Liu, L Li, Z Liang, Q Zheng, R Wang, ... arXiv preprint arXiv:2503.01710, 2025	94	2025
Harmony: Heterogeneous multi-modal federated learning through disentangled model training X Ouyang, Z Xie, H Fu, S Cheng, L Pan, N Ling, G Xing, J Zhou, J Huang Proceedings of the 21st Annual International Conference on Mobile Systems …, 2023	80	2023
ADMarker: A Multi-Modal Federated Learning System for Monitoring Digital Biomarkers of Alzheimer's Disease X Ouyang, X Shuai, Y Li, L Pan, X Zhang, H Fu, S Cheng, X Wang, S Cao, ... Proceedings of the 30th Annual International Conference on Mobile Computing …, 2024	51	2024
ASR-free pronunciation assessment S Cheng, Z Liu, L Li, Z Tang, D Wang, TF Zheng arXiv preprint arXiv:2005.11902, 2020	44	2020
Both ears wide open: Towards language-driven spatial audio generation P Sun, S Cheng, X Li, Z Ye, H Liu, H Zhang, W Xue, Y Guo arXiv preprint arXiv:2410.10676, 2024	25	2024
Audio-flan: A preliminary release L Xue, Z Zhou, J Pan, Z Li, S Fan, Y Ma, S Cheng, D Yang, H Guo, Y Xiao, ... arXiv preprint arXiv:2502.16584, 2025	3	2025
AquaScan: A Sonar-based Underwater Sensing System for Human Activity Monitoring H Hou, B Zheng, S Cheng, X Zhao, P Wu, L He, Y Guo, G Xing, Z Yan Proceedings of the 31st Annual International Conference on Mobile Computing …, 2025		2025
UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice S Cheng, W Bian, X Wang, R Yuan, J Chen, S Yin, Y Guo, W Xue arXiv preprint arXiv:2509.21144, 2025		2025

The system can't perform the operation now. Try again later.

Articles 1–9

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors