[go: up one dir, main page]

Follow
Cody (Hao) Yu
Cody (Hao) Yu
Member of Technical Staff @ OpenAI | ex-Amazon,Anyscale | UCLA PhD ‘19
Verified email at openai.com - Homepage
Title
Cited by
Cited by
Year
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th symposium on operating systems principles, 611-626, 2023
43872023
Ansor: Generating {High-Performance} tensor programs for deep learning
L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali, Y Wang, J Yang, D Zhuo, ...
14th USENIX symposium on operating systems design and implementation (OSDI …, 2020
6072020
Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs
X Wei, CH Yu, P Zhang, Y Chen, Y Wang, H Hu, Y Liang, J Cong
Proceedings of the 54th Annual Design Automation Conference 2017, 1-6, 2017
5482017
Sglang: Efficient execution of structured language model programs
L Zheng, L Yin, Z Xie, CL Sun, J Huang, CH Yu, S Cao, C Kozyrakis, ...
Advances in neural information processing systems 37, 62557-62583, 2024
3702024
AutoDSE: Enabling software programmers to design efficient FPGA accelerators
A Sohrabizadeh, CH Yu, M Gao, J Cong
ACM Transactions on Design Automation of Electronic Systems (TODAES) 27 (4 …, 2022
1422022
HeteroCL: A multi-paradigm programming infrastructure for software-defined reconfigurable computing
YH Lai, Y Chi, Y Hu, J Wang, CH Yu, Y Zhou, J Cong, Z Zhang
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
1422019
Efficiently Programming Large Language Models using SGLang.
L Zheng, L Yin, Z Xie, J Huang, C Sun, CH Yu, S Cao, C Kozyrakis, ...
arXiv, 2023
1242023
Tensorir: An abstraction for automatic tensorized program optimization
S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
1222023
Programming and runtime support to blaze FPGA accelerator deployment at datacenter scale
M Huang, D Wu, CH Yu, Z Fang, M Interlandi, T Condie, J Cong
Proceedings of the Seventh ACM Symposium on Cloud Computing, 456-469, 2016
1192016
TGPA: Tile-grained pipeline architecture for low latency CNN inference
X Wei, Y Liang, X Li, CH Yu, P Zhang, J Cong
2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018
992018
Automated accelerator generation and optimization with composable, parallel and pipeline architecture
J Cong, P Wei, CH Yu, P Zhang
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
902018
Systems and methods for systolic array design from a high-level program
P Zhang, CH Yu, X Wei, P Pan
US Patent 10,838,910, 2020
862020
The SMEM Seeding Acceleration for DNA Sequence Alignment
MCF Chang, YT Chen, J Cong, PT Huang, CL Kuo, CH Yu
The 24th IEEE International Symposium on Field-Programmable Custom Computing …, 2016
732016
Bandwidth Optimization Through On-Chip Memory Restructuring for HLS
J Cong, P Wei, CH Yu, P Zhou
692017
S2FA: An accelerator automation framework for heterogeneous computing in datacenters
CH Yu, P Wei, M Grossman, P Zhang, V Sarker, J Cong
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
622018
Tensor program optimization with probabilistic programs
J Shao, X Zhou, S Feng, B Hou, R Lai, H Jin, W Lin, M Masuda, CH Yu, ...
Advances in Neural Information Processing Systems 35, 35783-35796, 2022
612022
DietCode: Automatic optimization for dynamic tensor programs
B Zheng, Z Jiang, CH Yu, H Shen, J Fromm, Y Liu, Y Wang, L Ceze, ...
Proceedings of Machine Learning and Systems 4, 848-863, 2022
612022
Hidet: Task-mapping programming paradigm for deep learning tensor programs
Y Ding, CH Yu, B Zheng, Y Liu, Y Wang, G Pekhimenko
Proceedings of the 28th ACM International Conference on Architectural …, 2023
592023
On the preconditioner of conjugate gradient method: a power grid simulation perspective
CH Chou, NY Tsai, H Yu, CR Lee, Y Shi, SC Chang
Proceedings of the International Conference on Computer-Aided Design, 494-497, 2010
442010
Best-effort FPGA programming: A few steps can go a long way
J Cong, Z Fang, Y Hao, P Wei, CH Yu, C Zhang, P Zhou
arXiv preprint arXiv:1807.01340, 2018
382018
The system can't perform the operation now. Try again later.
Articles 1–20