[go: up one dir, main page]

Follow
Bodun  Hu
Title
Cited by
Cited by
Year
Towards a machine learning-assisted kernel with lake
H Fingler, I Tarte, H Yu, A Szekely, B Hu, A Akella, CJ Rossbach
Proceedings of the 28th ACM International Conference on Architectural …, 2023
402023
Altis: Modernizing GPGPU Benchmarks
B Hu, CJ Rossbach
Proceedings of the 2020 IEEE International Symposium on Performance Analysis …, 2020
37*2020
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
A Jaiswal, B Hu, L Yin, Y Ro, S Liu, T Chen, A Akella
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
182024
Blockllm: Multi-tenant finer-grained serving for large language models
B Hu, J Li, L Xu, M Lee, A Jajoo, GW Kim, H Xu, A Akella
arXiv preprint arXiv:2404.18322, 2024
122024
ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
L Tang, G Kim, X Zhao, T Lake, W Ding, F Yin, P Singhal, M Wadhwa, ...
arXiv preprint arXiv:2505.13444, 2025
112025
MOSEL: Inference Serving Using Dynamic Modality Selection
B Hu, L Xu, J Moon, NJ Yadwadkar, A Akella
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
62024
Patchwork: A Unified Framework for RAG Serving
B Hu, L Pabon, S Agarwal, A Akella
arXiv preprint arXiv:2505.07833, 2025
12025
StitchLLM: Serving LLMs, One Block at a Time
B Hu, S Li, S Agarwal, M Lee, A Jajoo, J Li, L Xu, GW Kim, D Kim, H Xu, ...
Proceedings of the 63rd Annual Meeting of the Association for Computational …, 2025
2025
The system can't perform the operation now. Try again later.
Articles 1–8