| LogicNets: Co-designed neural networks and circuits for extreme-throughput applications Y Akhauri *, Y Umuroglu *, NJ Fraser, M Blott 2020 30th International Conference on Field-Programmable Logic and …, 2020 | 160* | 2020 |
| Accelerating diffusion language model inference via efficient kv caching and guided diffusion Z Hu, J Meng, Y Akhauri, MS Abdelfattah, J Seo, Z Zhang, U Gupta arXiv preprint arXiv:2505.21467, 2025 | 23 | 2025 |
| EZNAS: evolving zero-cost proxies for neural architecture scoring Y Akhauri, J Munoz, N Jain, R Iyer Advances in Neural Information Processing Systems 35, 30459-30470, 2022 | 23* | 2022 |
| Enabling nas with automated super-network generation JP Muñoz, N Lyalyushkin, Y Akhauri, A Senina, A Kozlov, N Jain arXiv preprint arXiv:2112.10878, 2021 | 20 | 2021 |
| Shadowllm: Predictor-based contextual sparsity for large language models Y Akhauri, AF AbouElhamayed, J Dotzel, Z Zhang, AM Rush, S Huda, ... arXiv preprint arXiv:2406.16635, 2024 | 15 | 2024 |
| Esoteric Language Models SS Sahoo, Z Yang, Y Akhauri, J Liu, D Singh, Z Cheng, Z Liu, E Xing, ... arXiv preprint arXiv:2506.01928, 2025 | 12 | 2025 |
| Encodings for prediction-based neural architecture search Y Akhauri, MS Abdelfattah arXiv preprint arXiv:2403.02484, 2024 | 12 | 2024 |
| HadaNets: Flexible quantization strategies for neural networks Y Akhauri Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 12 | 2019 |
| xkv: Cross-layer svd for kv-cache compression CC Chang, CY Lin, Y Akhauri, WC Lin, KC Wu, L Ceze, MS Abdelfattah arXiv preprint arXiv:2503.18893, 2025 | 10 | 2025 |
| High-throughput dnn inference with logicnets Y Umuroglu, Y Akhauri, NJ Fraser, M Blott 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 9 | 2020 |
| Splitreason: Learning to offload reasoning Y Akhauri, A Fei, CC Chang, AF AbouElhamayed, Y Li, MS Abdelfattah arXiv preprint arXiv:2504.16379, 2025 | 8 | 2025 |
| Multi-predict: few shot predictors for efficient neural architecture search Y Akhauri, MS Abdelfattah arXiv preprint arXiv:2306.02459, 2023 | 8 | 2023 |
| On latency predictors for neural architecture search Y Akhauri, MS Abdelfattah Proceedings of Machine Learning and Systems 6, 512-523, 2024 | 7 | 2024 |
| Methods and apparatus to perform weight and activation compression and decompression N Jain, M Adelman, R Sade, R Iyer, R Poornachandran, Y Akhauri US Patent 12,425,047, 2025 | 6 | 2025 |
| Apparatus, articles of manufacture, and methods for composable machine learning compute nodes E Nurvitadhi, R Poornachandran, A Davare, N Jain, C Lacewell, ... US Patent App. 17/558,284, 2022 | 6 | 2022 |
| Performance Prediction for Large Systems via Text-to-Text Regression Y Akhauri, B Lewandowski, CH Lin, AN Reyes, GC Forbes, ... arXiv preprint arXiv:2506.21718, 2025 | 4 | 2025 |
| Rhnas: Realizable hardware and neural architecture search Y Akhauri, A Niranjan, JP Muñoz, S Banerjee, A Davare, P Cocchini, ... arXiv preprint arXiv:2106.09180, 2021 | 3 | 2021 |
| Regression language models for code Y Akhauri, X Song, A Wongpanich, B Lewandowski, MS Abdelfattah arXiv preprint arXiv:2509.26476, 2025 | 2 | 2025 |
| Tokenbutler: Token importance is predictable Y Akhauri, AF AbouElhamayed, Y Gao, CC Chang, N Jain, ... arXiv preprint arXiv:2503.07518, 2025 | 2 | 2025 |
| SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs AF AbouElhamayed, J Dotzel, Y Akhauri, CC Chang, S Gobriel, JP Muñoz, ... arXiv preprint arXiv:2502.12444, 2025 | 2 | 2025 |