Yash Akhauri

Cited by

	All	Since 2021
Citations	349	343
h-index	9	9
i10-index	9	9

160

120

201920202021202220232024202520261 3 30 32 47 66 158 7

Co-authors

Mohamed S. AbdelfattahCornell UniversityVerified email at cornell.edu
Nilesh JainPrincipal Engineer - AI Systems, Intel LabsVerified email at intel.com
Yaman UmurogluNorwegian University of Science and TechnologyVerified email at ntnu.no
Michaela BlottAMD ResearchVerified email at amd.com
Alexander M. RushAssociate Professor, Cornell UniversityVerified email at cornell.edu
Ravi IyerGoogleVerified email at google.com
Xingyou (Richard) SongResearch Scientist, Google DeepMindVerified email at google.com

Yash Akhauri

Google Research, PhD Candidate at Cornell University

Verified email at cornell.edu

Computer Vision Quantized Deep Learning AutoML Intelligent Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
LogicNets: Co-designed neural networks and circuits for extreme-throughput applications Y Akhauri , Y Umuroglu , NJ Fraser, M Blott 2020 30th International Conference on Field-Programmable Logic and …, 2020	160*	2020
Accelerating diffusion language model inference via efficient kv caching and guided diffusion Z Hu, J Meng, Y Akhauri, MS Abdelfattah, J Seo, Z Zhang, U Gupta arXiv preprint arXiv:2505.21467, 2025	23	2025
EZNAS: evolving zero-cost proxies for neural architecture scoring Y Akhauri, J Munoz, N Jain, R Iyer Advances in Neural Information Processing Systems 35, 30459-30470, 2022	23*	2022
Enabling nas with automated super-network generation JP Muñoz, N Lyalyushkin, Y Akhauri, A Senina, A Kozlov, N Jain arXiv preprint arXiv:2112.10878, 2021	20	2021
Shadowllm: Predictor-based contextual sparsity for large language models Y Akhauri, AF AbouElhamayed, J Dotzel, Z Zhang, AM Rush, S Huda, ... arXiv preprint arXiv:2406.16635, 2024	15	2024
Esoteric Language Models SS Sahoo, Z Yang, Y Akhauri, J Liu, D Singh, Z Cheng, Z Liu, E Xing, ... arXiv preprint arXiv:2506.01928, 2025	12	2025
Encodings for prediction-based neural architecture search Y Akhauri, MS Abdelfattah arXiv preprint arXiv:2403.02484, 2024	12	2024
HadaNets: Flexible quantization strategies for neural networks Y Akhauri Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019	12	2019
xkv: Cross-layer svd for kv-cache compression CC Chang, CY Lin, Y Akhauri, WC Lin, KC Wu, L Ceze, MS Abdelfattah arXiv preprint arXiv:2503.18893, 2025	10	2025
High-throughput dnn inference with logicnets Y Umuroglu, Y Akhauri, NJ Fraser, M Blott 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020	9	2020
Splitreason: Learning to offload reasoning Y Akhauri, A Fei, CC Chang, AF AbouElhamayed, Y Li, MS Abdelfattah arXiv preprint arXiv:2504.16379, 2025	8	2025
Multi-predict: few shot predictors for efficient neural architecture search Y Akhauri, MS Abdelfattah arXiv preprint arXiv:2306.02459, 2023	8	2023
On latency predictors for neural architecture search Y Akhauri, MS Abdelfattah Proceedings of Machine Learning and Systems 6, 512-523, 2024	7	2024
Methods and apparatus to perform weight and activation compression and decompression N Jain, M Adelman, R Sade, R Iyer, R Poornachandran, Y Akhauri US Patent 12,425,047, 2025	6	2025
Apparatus, articles of manufacture, and methods for composable machine learning compute nodes E Nurvitadhi, R Poornachandran, A Davare, N Jain, C Lacewell, ... US Patent App. 17/558,284, 2022	6	2022
Performance Prediction for Large Systems via Text-to-Text Regression Y Akhauri, B Lewandowski, CH Lin, AN Reyes, GC Forbes, ... arXiv preprint arXiv:2506.21718, 2025	4	2025
Rhnas: Realizable hardware and neural architecture search Y Akhauri, A Niranjan, JP Muñoz, S Banerjee, A Davare, P Cocchini, ... arXiv preprint arXiv:2106.09180, 2021	3	2021
Regression language models for code Y Akhauri, X Song, A Wongpanich, B Lewandowski, MS Abdelfattah arXiv preprint arXiv:2509.26476, 2025	2	2025
Tokenbutler: Token importance is predictable Y Akhauri, AF AbouElhamayed, Y Gao, CC Chang, N Jain, ... arXiv preprint arXiv:2503.07518, 2025	2	2025
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs AF AbouElhamayed, J Dotzel, Y Akhauri, CC Chang, S Gobriel, JP Muñoz, ... arXiv preprint arXiv:2502.12444, 2025	2	2025

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors