| Bitmod: Bit-serial mixture-of-datatype llm acceleration Y Chen, AF AbouElhamayed, X Dai, Y Wang, M Andronic, ... 2025 IEEE International Symposium on High Performance Computer Architecture …, 2025 | 20 | 2025 |
| Shadowllm: Predictor-based contextual sparsity for large language models Y Akhauri, AF AbouElhamayed, J Dotzel, Z Zhang, AM Rush, S Huda, ... arXiv preprint arXiv:2406.16635, 2024 | 15 | 2024 |
| PQA: Exploring the Potential of Product Quantization in DNN Hardware Acceleration AF AbouElhamayed, A Cui, J Fernandez-Marques, ND Lane, ... ArXiv, abs/2305.18334, 2023 | 14* | 2023 |
| An enhanced genetic algorithm-based timetabling system with incremental changes AF AbouElhamayed, AS Mahmoud, TT Shaaban, C Salama, AH Yousef 2016 11th International Conference on Computer Engineering & Systems (ICCES …, 2016 | 9 | 2016 |
| Splitreason: Learning to offload reasoning Y Akhauri, A Fei, CC Chang, AF AbouElhamayed, Y Li, MS Abdelfattah arXiv preprint arXiv:2504.16379, 2025 | 8 | 2025 |
| Tokenbutler: Token importance is predictable Y Akhauri, AF AbouElhamayed, Y Gao, CC Chang, N Jain, ... arXiv preprint arXiv:2503.07518, 2025 | 2 | 2025 |
| SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs AF AbouElhamayed, J Dotzel, Y Akhauri, CC Chang, S Gobriel, JP Muñoz, ... arXiv preprint arXiv:2502.12444, 2025 | 2 | 2025 |
| Beyond Inference: Performance Analysis of DNN Server Overheads for Computer Vision A Abouelhamayed, S Balle, D Singh, M Abdelfattah Proceedings of the 61st ACM/IEEE Design Automation Conference, 1-6, 2024 | 2 | 2024 |
| Low-Cost Traffic Control using Reinforcement Learning AF AbouElhamayed, H Mahdi, C Salama 2019 14th International Conference on Computer Engineering and Systems …, 2019 | | 2019 |
| Bit-serial Acceleration of LLM Inference with Mixture-of-Datatype Quantization Y Chen, CC Chang, X Dai, AF AbouElhamayed, M Andronic | | |
| Low-Cost Traffic Control Using Reinforcement Learning With Limited State AF AbouElhamayed, H Mahdi, C Salama | | |