| Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 2021 | 17491* | 2021 |
| Opt: Open pre-trained transformer language models S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... arXiv preprint arXiv:2205.01068, 2022 | 4202 | 2022 |
| Multilingual denoising pre-training for neural machine translation Y Liu, J Gu, N Goyal, X Li, S Edunov, M Ghazvininejad, M Lewis, ... Transactions of the Association for Computational Linguistics 8, 726-742, 2020 | 2409 | 2020 |
| Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, and Luke Zettlemoyer. 2022 S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... Opt: Open pretrained transformer language models 1, 2022 | 1168 | 2022 |
| Chameleon: Mixed-modal early-fusion foundation models C Team arXiv preprint arXiv:2405.09818, 2024 | 653 | 2024 |
| Self-rewarding language models W Yuan, RY Pang, K Cho, X Li, S Sukhbaatar, J Xu, JE Weston Forty-first International Conference on Machine Learning, 2024 | 630 | 2024 |
| Multilingual translation with extensible multilingual pretraining and finetuning Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan arXiv preprint arXiv:2008.00401, 2020 | 552 | 2020 |
| Self-alignment with instruction backtranslation X Li, P Yu, C Zhou, T Schick, O Levy, L Zettlemoyer, J Weston, M Lewis arXiv preprint arXiv:2308.06259, 2023 | 321 | 2023 |
| Training large language models to reason in a continuous latent space S Hao, S Sukhbaatar, DJ Su, X Li, Z Hu, J Weston, Y Tian arXiv preprint arXiv:2412.06769, 2024 | 301 | 2024 |
| TWC LOGD: A portal for linked open government data ecosystems L Ding, T Lebo, JS Erickson, D DiFranzo, GT Williams, X Li, J Michaelis, ... Journal of Web Semantics 9 (3), 325-333, 2011 | 269 | 2011 |
| Flowseq: Non-autoregressive conditional sequence generation with generative flow X Ma, C Zhou, X Li, G Neubig, E Hovy arXiv preprint arXiv:1909.02480, 2019 | 235 | 2019 |
| Efficient large scale language modeling with mixtures of experts M Artetxe, S Bhosale, N Goyal, T Mihaylov, M Ott, S Shleifer, XV Lin, J Du, ... arXiv preprint arXiv:2112.10684, 2021 | 205 | 2021 |
| Multilingual translation from denoising pre-training Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 202 | 2021 |
| A corpus for multilingual document classification in eight languages H Schwenk, X Li arXiv preprint arXiv:1805.09821, 2018 | 179 | 2018 |
| Multilingual speech translation from efficient finetuning of pretrained models X Li, C Wang, Y Tang, C Tran, Y Tang, J Pino, A Baevski, A Conneau, ... Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 173 | 2021 |
| On evaluation of adversarial perturbations for sequence-to-sequence models P Michel, X Li, G Neubig, J Pino Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 172 | 2019 |
| Lifting the curse of multilinguality by pre-training modular transformers J Pfeiffer, N Goyal, X Lin, X Li, J Cross, S Riedel, M Artetxe Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 146 | 2022 |
| Data-gov Wiki: Towards Linking Government Data. L Ding, D DiFranzo, A Graves, J Michaelis, X Li, DL McGuinness, ... AAAI Spring Symposium: Linked data meets artificial intelligence 10, 1-1, 2010 | 134 | 2010 |
| Jingfei Du, et al. 2021. Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 35-40, 2021 | 127 | 2021 |
| Self-taught evaluators T Wang, I Kulikov, O Golovneva, P Yu, W Yuan, J Dwivedi-Yu, RY Pang, ... arXiv preprint arXiv:2408.02666, 2024 | 123 | 2024 |