Ting Cao 曹婷

Cited by

	All	Since 2021
Citations	2189	1710
h-index	27	23
i10-index	46	42

980

490

245

735

20122013201420152016201720182019202020212022202320242025202618 48 38 56 69 69 48 60 50 65 121 184 353 968 16

Public access

View all

22 articles

3 articles

available

not available

Based on funding mandates

Co-authors

Mao Yang 杨懋Microsoft ResearchVerified email at microsoft.com
Yunxin LiuIEEE Fellow, Guoqiang Professor, Institute for AI Industry Research (AIR), Tsinghua UniversityVerified email at air.tsinghua.edu.cn
Shiqi JiangMicrosoft ResearchVerified email at microsoft.com
Jianyu WeiUSTC & MSRA Joint PhDVerified email at mail.ustc.edu.cn
Yuqing YangMicrosoftVerified email at microsoft.com
Li Lyna ZhangMicrosoft Research AsiaVerified email at microsoft.com
Ju RenDepartment of Computer Science and Technology, Tsinghua UniversityVerified email at tsinghua.edu.cn
Kathryn S McKinleyGoogleVerified email at cs.utexas.edu
Fan YangMicrosoft ResearchVerified email at microsoft.com
Ningxin ZhengBytedance AMLVerified email at bytedance.com
Steve BlackburnResearch Scientist, Google | Professor of Computer Science, Australian National UniversityVerified email at google.com
Yuanchun LiInstitute for AI Industry Research (AIR), Tsinghua UniversityVerified email at air.tsinghua.edu.cn
Lingxiao MaSenior Researcher, Microsoft ResearchVerified email at pku.edu.cn
Kun LiAssistant Professor, Tsinghua UniversityVerified email at air.tsinghua.edu.cn
Liang YuanInstitute of Computing TechnologyVerified email at ict.ac.cn
Lili QiuNAI Fellow, ACM Fellow, IEEE Fellow, Professor, Dept. of Computer Science, The University of TexasVerified email at cs.utexas.edu
Huiqiang JiangMicrosoft Research AsiaVerified email at microsoft.com
Dayou DuUniversity of EdinburghVerified email at sms.ed.ac.uk
Yijia ZhangShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Shihao HanThe Univeristy of Hong KongVerified email at connect.hku.hk

Ting Cao 曹婷

Professor, Tsinghua University

Verified email at mail.tsinghua.edu.cn - Homepage

Deep learning Edge AI Computer Architecture Energy Efficiency


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Nn-meter: Towards accurate latency prediction of deep-learning model inference on diverse edge devices LL Zhang, S Han, J Wei, N Zheng, T Cao, Y Yang, Y Liu Proceedings of the 19th Annual International Conference on Mobile Systems …, 2021	198	2021
The yin and yang of power and performance for asymmetric hardware and managed software T Cao, SM Blackburn, T Gao, KS McKinley ACM SIGARCH Computer Architecture News 40 (3), 225-236, 2012	145	2012
Parallel processing systems for big data: a survey Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos Proceedings of the IEEE 104 (11), 2114-2136, 2016	128	2016
Looking back on the language and hardware revolutions: measured power, performance, and scaling H Esmaeilzadeh, T Cao, Y Xi, SM Blackburn, KS McKinley ACM SIGARCH Computer Architecture News 39 (1), 319-332, 2011	115	2011
Panthera: Holistic memory management for big data processing over hybrid memories C Wang, H Cui, T Cao, J Zigman, H Volos, O Mutlu, F Lv, X Feng, GH Xu Proceedings of the 40th ACM SIGPLAN Conference on Programming Language …, 2019	93	2019
CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices. F Jia, D Zhang, T Cao, S Jiang, Y Liu, J Ren, Y Zhang MobiSys 22, 209-221, 2022	82	2022
Asymo: scalable and efficient deep-learning inference on asymmetric mobile cpus M Wang, S Ding, T Cao, Y Liu, F Xu Proceedings of the 27th Annual International Conference on Mobile Computing …, 2021	81	2021
Pre-gated moe: An algorithm-system co-design for fast and scalable mixture-of-expert inference R Hwang, J Wei, S Cao, C Hwang, X Tang, T Cao, M Yang 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024	73	2024
Bitdistiller: Unleashing the potential of sub-4-bit llms via self-distillation D Du, Y Zhang, S Cao, J Guo, T Cao, X Chu, N Xu arXiv preprint arXiv:2402.10631, 2024	66	2024
Seerattention: Learning intrinsic sparse attention in your llms Y Gao, Z Zeng, D Du, S Cao, P Zhou, J Qi, J Lai, HKH So, T Cao, F Yang, ... arXiv preprint arXiv:2410.13276, 2024	64	2024
Hybrid slm and llm for edge-cloud collaborative inference Z Hao, H Jiang, S Jiang, J Ren, T Cao Proceedings of the Workshop on Edge and Mobile Foundation Models, 36-41, 2024	61	2024
WADE: Writeback-aware dynamic cache management for NVM-based main memory system Z Wang, S Shan, T Cao, J Gu, Y Xu, S Mu, Y Xie, DA Jiménez ACM Transactions on Architecture and Code Optimization (TACO) 10 (4), 1-21, 2013	59	2013
T-mac: Cpu renaissance via table lookup for low-bit llm deployment on edge J Wei, S Cao, T Cao, L Ma, L Wang, Y Zhang, M Yang Proceedings of the Twentieth European Conference on Computer Systems, 278-292, 2025	45	2025
Looking back and looking forward: power, performance, and upheaval H Esmaeilzadeh, T Cao, X Yang, SM Blackburn, KS McKinley Communications of the ACM 55 (7), 105-114, 2012	45	2012
Ladder: Enabling efficient {Low-Precision} deep learning computing through hardware-aware tensor transformation L Wang, L Ma, S Cao, Q Zhang, J Xue, Y Shi, N Zheng, Z Miao, F Yang, ... 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	43	2024
Integer or floating point? new outlooks for low-bit quantization on large language models Y Zhang, L Zhao, S Cao, S Zhang, W Wang, T Cao, F Yang, M Yang, ... 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024	40	2024
Flexnn: Efficient and adaptive dnn inference on memory-constrained edge devices X Li, Y Li, Y Li, T Cao, Y Liu Proceedings of the 30th Annual International Conference on Mobile Computing …, 2024	40	2024
Pre-gated moe: An algorithm-system co-design for fast and scalable mixture-of-expert inference. In 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) R Hwang, J Wei, S Cao, C Hwang, X Tang, T Cao, M Yang IEEE 2, 1018-1031, 2024	40	2024
Lut-nn: Empower efficient neural network inference with centroid learning and table lookup X Tang, Y Wang, T Cao, LL Zhang, Q Chen, D Cai, Y Liu, M Yang Proceedings of the 29th Annual International Conference on Mobile Computing …, 2023	36	2023
Vptq: Extreme low-bit vector post-training quantization for large language models Y Liu, J Wen, Y Wang, S Ye, LL Zhang, T Cao, C Li, M Yang arXiv preprint arXiv:2409.17066, 2024	34	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors