[go: up one dir, main page]

Follow
Sneha Kudugunta
Sneha Kudugunta
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
34392024
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
23072023
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13372025
Deep neural networks for bot detection
S Kudugunta, E Ferrara
Information Sciences 467, 312-322, 2018
6402018
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D Van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
2612022
Madlad-400: A multilingual and document-level large audited dataset
S Kudugunta, I Caswell, B Zhang, X Garcia, D Xin, A Kusupati, R Stella, ...
Advances in Neural Information Processing Systems 36, 67284-67296, 2023
2152023
Beyond distillation: Task-level mixture-of-experts for efficient inference
S Kudugunta, Y Huang, A Bapna, M Krikun, D Lepikhin, MT Luong, O Firat
arXiv preprint arXiv:2110.03742, 2021
1462021
Investigating Multilingual NMT Representations at Scale
SR Kudugunta, A Bapna, I Caswell, N Arivazhagan, O Firat
arXiv preprint arXiv:1909.02197, 2019
1452019
A loss curvature perspective on training instabilities of deep learning models
J Gilmer, B Ghorbani, A Garg, S Kudugunta, B Neyshabur, D Cardoze, ...
International Conference on Learning Representations, 2022
97*2022
Mural: multimodal, multitask retrieval across languages
A Jain, M Guo, K Srinivasan, T Chen, S Kudugunta, C Jia, Y Yang, ...
arXiv preprint arXiv:2109.05125, 2021
972021
Leveraging monolingual data with self-supervision for multilingual neural machine translation
A Siddhant, A Bapna, Y Cao, O Firat, MX Chen, S Kudugunta, ...
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
852020
BUFFET: Benchmarking large language models for few-shot cross-lingual transfer
A Asai, S Kudugunta, X Yu, T Blevins, H Gonen, M Reid, Y Tsvetkov, ...
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
82*2024
MatFormer: Nested Transformer for Elastic Inference
Devvrit*, S Kudugunta*, A Kusupati*, T Dettmers, K Chen, I Dhillon, ...
arXiv preprint arXiv:2310.07707, 2023
632023
(mis) fitting scaling laws: A survey of scaling law fitting techniques in deep learning
M Li, S Kudugunta, L Zettlemoyer
The Thirteenth International Conference on Learning Representations, 2025
14*2025
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
S Longpre, S Kudugunta, N Muennighoff, I Hsu, I Caswell, A Pentland, ...
arXiv preprint arXiv:2510.22037, 2025
12025
Systems and methods for routing within multitask mixture-of-experts models
Y Huang, D Lepikhin, M Krikun, O Firat, A Bapna, T Luong, S Kudugunta
US Patent 12,242,948, 2025
12025
MiTTenS: A dataset for evaluating gender mistranslation
K Robinson, S Kudugunta, R Stella, S Dev, J Bastings
arXiv preprint arXiv:2401.06935, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–17