[go: up one dir, main page]

Follow
Yash Akhauri
Yash Akhauri
Google Research, PhD Candidate at Cornell University
Verified email at cornell.edu
Title
Cited by
Cited by
Year
LogicNets: Co-designed neural networks and circuits for extreme-throughput applications
Y Akhauri *, Y Umuroglu *, NJ Fraser, M Blott
2020 30th International Conference on Field-Programmable Logic and …, 2020
160*2020
Accelerating diffusion language model inference via efficient kv caching and guided diffusion
Z Hu, J Meng, Y Akhauri, MS Abdelfattah, J Seo, Z Zhang, U Gupta
arXiv preprint arXiv:2505.21467, 2025
232025
EZNAS: evolving zero-cost proxies for neural architecture scoring
Y Akhauri, J Munoz, N Jain, R Iyer
Advances in Neural Information Processing Systems 35, 30459-30470, 2022
23*2022
Enabling nas with automated super-network generation
JP Muñoz, N Lyalyushkin, Y Akhauri, A Senina, A Kozlov, N Jain
arXiv preprint arXiv:2112.10878, 2021
202021
Shadowllm: Predictor-based contextual sparsity for large language models
Y Akhauri, AF AbouElhamayed, J Dotzel, Z Zhang, AM Rush, S Huda, ...
arXiv preprint arXiv:2406.16635, 2024
152024
Esoteric Language Models
SS Sahoo, Z Yang, Y Akhauri, J Liu, D Singh, Z Cheng, Z Liu, E Xing, ...
arXiv preprint arXiv:2506.01928, 2025
122025
Encodings for prediction-based neural architecture search
Y Akhauri, MS Abdelfattah
arXiv preprint arXiv:2403.02484, 2024
122024
HadaNets: Flexible quantization strategies for neural networks
Y Akhauri
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
122019
xkv: Cross-layer svd for kv-cache compression
CC Chang, CY Lin, Y Akhauri, WC Lin, KC Wu, L Ceze, MS Abdelfattah
arXiv preprint arXiv:2503.18893, 2025
102025
High-throughput dnn inference with logicnets
Y Umuroglu, Y Akhauri, NJ Fraser, M Blott
2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020
92020
Splitreason: Learning to offload reasoning
Y Akhauri, A Fei, CC Chang, AF AbouElhamayed, Y Li, MS Abdelfattah
arXiv preprint arXiv:2504.16379, 2025
82025
Multi-predict: few shot predictors for efficient neural architecture search
Y Akhauri, MS Abdelfattah
arXiv preprint arXiv:2306.02459, 2023
82023
On latency predictors for neural architecture search
Y Akhauri, MS Abdelfattah
Proceedings of Machine Learning and Systems 6, 512-523, 2024
72024
Methods and apparatus to perform weight and activation compression and decompression
N Jain, M Adelman, R Sade, R Iyer, R Poornachandran, Y Akhauri
US Patent 12,425,047, 2025
62025
Apparatus, articles of manufacture, and methods for composable machine learning compute nodes
E Nurvitadhi, R Poornachandran, A Davare, N Jain, C Lacewell, ...
US Patent App. 17/558,284, 2022
62022
Performance Prediction for Large Systems via Text-to-Text Regression
Y Akhauri, B Lewandowski, CH Lin, AN Reyes, GC Forbes, ...
arXiv preprint arXiv:2506.21718, 2025
42025
Rhnas: Realizable hardware and neural architecture search
Y Akhauri, A Niranjan, JP Muñoz, S Banerjee, A Davare, P Cocchini, ...
arXiv preprint arXiv:2106.09180, 2021
32021
Regression language models for code
Y Akhauri, X Song, A Wongpanich, B Lewandowski, MS Abdelfattah
arXiv preprint arXiv:2509.26476, 2025
22025
Tokenbutler: Token importance is predictable
Y Akhauri, AF AbouElhamayed, Y Gao, CC Chang, N Jain, ...
arXiv preprint arXiv:2503.07518, 2025
22025
SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs
AF AbouElhamayed, J Dotzel, Y Akhauri, CC Chang, S Gobriel, JP Muñoz, ...
arXiv preprint arXiv:2502.12444, 2025
22025
The system can't perform the operation now. Try again later.
Articles 1–20