[go: up one dir, main page]

Follow
Fajri Koto
Fajri Koto
Assistant Professor (tenure-track), MBZUAI
Verified email at mbzuai.ac.ae - Homepage
Title
Cited by
Cited by
Year
NusaCrowd: Open source initiative for Indonesian NLP resources
S Cahyawijaya, H Lovenia, AF Aji, GI Winata, B Wilie, F Koto, R Mahendra, ...
Findings of the Association for Computational Linguistics: ACL 2023, 13745-13818, 2023
8172023
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP
F Koto, A Rahimi, JH Lau, T Baldwin
Proceedings of the 28th COLING 2020, 757-770, 2020
4552020
CMMLU: Measuring Massive Multitask Language Understanding in Chinese
H Li, Y Zhang, F Koto, Y Yang, H Zhao, Y Gong, N Duan, T Baldwin
Findings of ACL 2024, 2024
4192024
Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs
F Koto, GY Rahmaningtyas
2017 International Conference on Asian Language Processing (IALP), 391-394, 2017
2302017
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, OM Afzal, ...
Technical Report, 2023
186*2023
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
AF Aji, GI Winata, F Koto, S Cahyawijaya, A Romadhony, R Mahendra, ...
Proceedings of ACL 2022, 2022
1532022
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization
F Koto, JH Lau, T Baldwin
Proceedings of EMNLP 2021, 2021
1412021
Llm360: Towards fully transparent open-source llms
Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ...
Proceedings of the First Conference on Language Modeling (COLM 2024), 2023
1122023
Bactrian-x: Multilingual replicable instruction-following models with low-rank adaptation
H Li, F Koto, M Wu, AF Aji, T Baldwin
arXiv preprint arXiv:2305.15011, 2023
1002023
Nusax: Multilingual parallel sentiment dataset for 10 indonesian local languages
GI Winata, AF Aji, S Cahyawijaya, R Mahendra, F Koto, A Romadhony, ...
Proceedings of the 17th EACL 2023, 2022
992022
Are multilingual llms culturally-diverse reasoners? an investigation into multicultural proverbs and sayings
CC Liu, F Koto, T Baldwin, I Gurevych
Proceedings of NAACL 2024, 2024
922024
A comparative study on twitter sentiment analysis: Which features are good?
F Koto, M Adriani
Proceedings of the 20th NLDB 2015, 453-457, 2015
912015
SMOTE-Out, SMOTE-Cosine, and Selected-SMOTE: An Enhancement Strategy to Handle Imbalance in Data Level
F Koto
The 6th ICACSIS, 2014
812014
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
D Romero, C Lyu, HA Wibowo, T Lynn, I Hamed, AN Kishore, A Mandal, ...
Proceedings of NeurIPS 2024, 2024
742024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
F Koto, H Li, S Shatnawi, J Doughman, AB Sadallah, A Alraeesi, ...
Findings of ACL 2024, 2024
722024
Discourse Probing of Pretrained Language Models
F Koto, JH Lau, T Baldwin
Proceedings of NAACL 2021, 2021
632021
Liputan6: A Large-scale Indonesian Dataset for Text Summarization
F Koto, JH Lau, T Baldwin
Proceedings of AACL 2020, 2020
612020
Apparatus and method for sharing personal electronic-data of health
A Kurniawan, O ABDILLAH, Fajri
US Patent App. 15/221,140, 2017
52*2017
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
F Koto, N Aisyah, H Li, T Baldwin
Proceedings of EMNLP 2023, 2023
462023
Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon
F Koto, T Beck, Z Talat, I Gurevych, T Baldwin
Proceedings of EACL 2024, 2024
422024
The system can't perform the operation now. Try again later.
Articles 1–20