| Scalable and efficient moe training for multitask multilingual models YJ Kim, AA Awan, A Muzio, AFC Salinas, L Lu, A Hendy, S Rajbhandari, ... arXiv preprint arXiv:2109.10465, 2021 | 114 | 2021 |
| Efficacy of a computer-based cognitive training program in older people with subjective memory complaints: a randomized study AJ Pereira-Morales, AF Cruz-Salinas, J Aponte, F Pereira-Manrique International Journal of Neuroscience 128 (1), 1-9, 2018 | 54 | 2018 |
| Aya expanse: Combining research breakthroughs for a new multilingual frontier J Dang, S Singh, D D'souza, A Ahmadian, A Salamanca, M Smith, ... arXiv preprint arXiv:2412.04261, 2024 | 43 | 2024 |
| Multifusion: Fusing pre-trained models for multi-lingual, multi-modal image generation M Bellagente, M Brack, H Teufel, F Friedrich, B Deiseroth, C Eichenberg, ... Advances in Neural Information Processing Systems 36, 59502-59521, 2023 | 35 | 2023 |
| Command a: An enterprise-ready large language model T Cohere, A Ahmadian, M Ahmed, J Alammar, M Alizadeh, Y Alnumay, ... arXiv preprint arXiv:2504.00698, 2025 | 27 | 2025 |
| u-P: The Unit-Scaled Maximal Update Parametrization C Blake, C Eichenberg, J Dean, L Balles, LY Prince, B Deiseroth, ... arXiv preprint arXiv:2407.17465, 2024 | 21 | 2024 |
| Self-adaptation of genetic operators through genetic programming techniques AF Cruz-Salinas, JG Perdomo Proceedings of the Genetic and Evolutionary Computation Conference, 913-920, 2017 | 21 | 2017 |
| Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition K Kumatani, R Gmyr, FC Salinas, L Liu, W Zuo, D Patel, E Sun, Y Shi arXiv preprint arXiv:2112.05820, 2021 | 18 | 2021 |
| M-vader: A model for diffusion with multimodal context S Weinbach, M Bellagente, C Eichenberg, A Dai, R Baldock, S Nanda, ... arXiv preprint arXiv:2212.02936, 2022 | 11 | 2022 |
| An interactive tool to support student assessment in programming assignments LF Rosales-Castro, LA Chaparro-Gutiérrez, AF Cruz-Salinas, ... Ibero-American Conference on Artificial Intelligence, 404-414, 2016 | 5 | 2016 |
| Knowledge distillation for mixture of experts models in speech recognition FC Salinas, K Kumatani, R Gmyr, L Liu, Y Shi Technical Report MSR-TR-2022-6, Microsoft Research, May 2022. https://www …, 2022 | 4 | 2022 |
| One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers D Abagyan, AR Salamanca, AF Cruz-Salinas, K Cao, H Lin, A Locatelli, ... arXiv preprint arXiv:2506.10766, 2025 | 3 | 2025 |