[go: up one dir, main page]

Follow
Andreas Peter Steiner
Andreas Peter Steiner
Software engineer, Google Research
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Mlp-mixer: An all-mlp architecture for vision
IO Tolstikhin, N Houlsby, A Kolesnikov, L Beyer, X Zhai, T Unterthiner, ...
Advances in neural information processing systems 34, 24261-24272, 2021
41452021
Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities
G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ...
arXiv preprint arXiv:2507.06261, 2025
13372025
Pali: A jointly-scaled multilingual language-image model
X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ...
arXiv preprint arXiv:2209.06794, 2022
10142022
How to train your vit? data, augmentation, and regularization in vision transformers
A Steiner, A Kolesnikov, X Zhai, R Wightman, J Uszkoreit, L Beyer
arXiv preprint arXiv:2106.10270, 2021
9402021
Scaling vision transformers to 22 billion parameters
M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ...
International conference on machine learning, 7480-7512, 2023
8842023
Gemma 3 technical report
G Team, A Kamath, J Ferret, S Pathak, N Vieillard, R Merhej, S Perrin, ...
arXiv preprint arXiv:2503.19786, 2025
8182025
Lit: Zero-shot transfer with locked-image text tuning
X Zhai, X Wang, B Mustafa, A Steiner, D Keysers, A Kolesnikov, L Beyer
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
7822022
Paligemma: A versatile 3b vlm for transfer
L Beyer, A Steiner, AS Pinto, A Kolesnikov, X Wang, D Salz, M Neumann, ...
arXiv preprint arXiv:2407.07726, 2024
5062024
Siglip 2: Multilingual vision-language encoders with improved semantic understanding, localization, and dense features
M Tschannen, A Gritsenko, X Wang, MF Naeem, I Alabdulmohsin, ...
arXiv preprint arXiv:2502.14786, 2025
3962025
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
2742023
Flax: A neural network library and ecosystem for JAX, 2020
J Heek, A Levskaya, A Oliver, M Ritter, B Rondepierre, A Steiner, ...
URL http://github. com/google/flax 1, 2020
2692020
Patch n’pack: Navit, a vision transformer for any aspect ratio and resolution
M Dehghani, B Mustafa, J Djolonga, J Heek, M Minderer, M Caron, ...
Advances in Neural Information Processing Systems 36, 2252-2274, 2023
2122023
Flax: A neural network library and ecosystem for JAX
J Heek, A Levskaya, A Oliver, M Ritter, B Rondepierre, A Steiner, ...
Version 0.3 3, 14-26, 2020
1992020
KvarQ: targeted and direct variant calling from fastq reads of bacterial genomes
A Steiner, D Stucki, M Coscolla, S Borrell, S Gagneux
BMC genomics 15 (1), 881, 2014
1742014
Paligemma 2: A family of versatile vlms for transfer
A Steiner, AS Pinto, M Tschannen, D Keysers, X Wang, Y Bitton, ...
arXiv preprint arXiv:2412.03555, 2024
1152024
Image captioners are scalable vision learners too
M Tschannen, M Kumar, A Steiner, X Zhai, N Houlsby, L Beyer
Advances in Neural Information Processing Systems 36, 46830-46855, 2023
972023
Gemma 3 technical report
A Kamath, J Ferret, S Pathak, N Vieillard, R Merhej, S Perrin, ...
CoRR, 2025
862025
How to train your ViT
A Steiner, A Kolesnikov, X Zhai, R Wightman, J Uszkoreit, L Beyer
Data, augmentation, and regularization in vision transformers 4, 5, 2021
742021
Mlp-mixer: An all-mlp architecture for vision, 2021
I Tolstikhin, N Houlsby, A Kolesnikov, L Beyer, X Zhai, T Unterthiner, ...
arXiv preprint arXiv:2105.01601, 0
41
No filter: Cultural and socioeconomic diversity in contrastive vision-language models
A Pouget, L Beyer, E Bugliarello, X Wang, A Steiner, X Zhai, ...
Advances in Neural Information Processing Systems 37, 106474-106496, 2024
362024
The system can't perform the operation now. Try again later.
Articles 1–20