| Searching for activation functions P Ramachandran, B Zoph, QV Le arXiv preprint arXiv:1710.05941, 2017 | 6253* | 2017 |
| Stand-alone self-attention in vision models P Ramachandran, N Parmar, A Vaswani, I Bello, A Levskaya, J Shlens Advances in neural information processing systems 32, 2019 | 1831 | 2019 |
| Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025 | 1217 | 2025 |
| Scaling local self-attention for parameter efficient visual backbones A Vaswani, P Ramachandran, A Srinivas, N Parmar, B Hechtman, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 601 | 2021 |
| Revisiting fundamentals of experience replay W Fedus, P Ramachandran, R Agarwal, Y Bengio, H Larochelle, ... International conference on machine learning, 3061-3071, 2020 | 422 | 2020 |
| Seq-nms for video object detection W Han, P Khorrami, TL Paine, P Ramachandran, M Babaeizadeh, H Shi, ... arXiv preprint arXiv:1602.08465, 2016 | 420 | 2016 |
| Unsupervised pretraining for sequence to sequence learning P Ramachandran, PJ Liu, Q Le Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017 | 375 | 2017 |
| Fast generation for convolutional autoregressive models P Ramachandran, TL Paine, P Khorrami, M Babaeizadeh, S Chang, ... arXiv preprint arXiv:1704.06001, 2017 | 208* | 2017 |
| Le Q. Searching for activation functions. ArXiv P Ramachandran preprint., 2017 | 198 | 2017 |
| Stein variational policy gradient Y Liu, P Ramachandran, Q Liu, J Peng arXiv preprint arXiv:1704.02399, 2017 | 186 | 2017 |
| Revisiting spatial invariance with low-rank local connectivity G Elsayed, P Ramachandran, J Shlens, S Kornblith International Conference on Machine Learning, 2868-2879, 2020 | 70 | 2020 |
| Searching for activation functions, 2018 P Ramachandran, B Zoph, QV Le URL https://openreview. net/forum, 18, 2018 | 53 | 2018 |
| Diversity and depth in per-example routing models P Ramachandran, QV Le International Conference on Learning Representations, 2018 | 43 | 2018 |
| Backprop evolution M Alber, I Bello, B Zoph, PJ Kindermans, P Ramachandran, Q Le arXiv preprint arXiv:1808.02822, 2018 | 19 | 2018 |
| Thermal erosion of magnetoplasmadynamic thruster cathode RC Mehta, S Andrews, PV Ramachandran International journal of heat and mass transfer 39 (8), 1767-1769, 1996 | 5 | 1996 |
| Fully attentional computer vision J Shlens, AT Vaswani, NJ Parmar, P Ramachandran, AC Levskaya, ... US Patent 12,354,340, 2025 | 3 | 2025 |
| Object Detection in Video using Faster R-CNN P Ramachandran ICCV Workshop, India, 2015 | 3 | 2015 |
| Local self-attention computer vision neural networks AT Vaswani, P Ramachandran, AS Lakshminarayanan, BA Hechtman, ... US Patent App. 17/347,416, 2021 | 2 | 2021 |
| Fully attentional computer vision J Shlens, AT Vaswani, NJ Parmar, P Ramachandran, AC Levskaya, ... US Patent App. 19/226,069, 2025 | | 2025 |
| Neural network layers with a controlled degree of spatial invariance G Elsayed, P Ramachandran, J Shlens, S Kornblith US Patent 12,265,911, 2025 | | 2025 |