| Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 3462 | 2024 |
| Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities G Comanici, E Bieber, M Schaekermann, I Pasupat, N Sachdeva, I Dhillon, ... arXiv preprint arXiv:2507.06261, 2025 | 1266 | 2025 |
| Audiopalm: A large language model that can speak and listen PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... arXiv preprint arXiv:2306.12925, 2023 | 355 | 2023 |
| Spoken question answering and speech continuation using spectrogram-powered llm E Nachmani, A Levkovitch, R Hirsch, J Salazar, C Asawaroengchai, ... arXiv preprint arXiv:2305.15255, 2023 | 95 | 2023 |
| Translatotron 3: Speech to speech translation with monolingual data E Nachmani, A Levkovitch, Y Ding, C Asawaroengchai, H Zen, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 18 | 2024 |
| STAB: speech tokenizer assessment benchmark S Vashishth, H Singh, S Bharadwaj, S Ganapathy, C Asawaroengchai, ... arXiv preprint arXiv:2409.02384, 2024 | 8 | 2024 |
| Probabilistic learning models for topic extraction i Thai language C Asawaroengchai, W Chaisangmongkon, D Laowattana 2018 5th International Conference on Business and Industrial Research (ICBIR …, 2018 | 7 | 2018 |
| Generating 360 degree interactive content V Trairattanapa, P Leelaphattarakij, S Phanvilai, J Sukkasem, ... US Patent App. 16/714,354, 2021 | 6 | 2021 |
| Ramanovich PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ... Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent …, 2023 | 5 | 2023 |
| Artificial intelligence for generating depth map C Asawaroengchai, S Phanvilai, P Leelaphattarakij, V Trairattanapa, ... US Patent App. 16/698,731, 2021 | 4 | 2021 |
| Performing tasks using generative neural networks PK Rubenstein, M Sharifi, A Tudor, C Asawaroengchai, DD Nguyen, ... US Patent App. 18/750,973, 2024 | | 2024 |
| Language models using spoken language modeling MD Tadmor, E Nachmani, A Levkovitch, J Salazar, C Asawaroengchai, ... US Patent App. 18/662,442, 2024 | | 2024 |
| Speech-to-speech translation with monolingual data MT Ramanovich, E Nachmani, A Levkovitch, B Chun, Y DING, N Bar, ... US Patent App. 18/589,358, 2024 | | 2024 |