Alert button
Picture for Fajri Koto

Fajri Koto

Alert button

Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages

Add code
Bookmark button
Alert button
Apr 09, 2024
Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Rifki Afina Putri, Emmanuel Dave, Jhonson Lee, Nuur Shadieq, Wawan Cenggoro, Salsabil Maulana Akbar, Muhammad Ihza Mahendra, Dea Annisayanti Putri, Bryan Wilie, Genta Indra Winata, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

Viaarxiv icon

IndoCulture: Exploring Geographically-Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces

Add code
Bookmark button
Alert button
Apr 02, 2024
Fajri Koto, Rahmad Mahendra, Nurul Aisyah, Timothy Baldwin

Viaarxiv icon

ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic

Add code
Bookmark button
Alert button
Feb 20, 2024
Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin

Viaarxiv icon

Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon

Add code
Bookmark button
Alert button
Feb 03, 2024
Fajri Koto, Tilman Beck, Zeerak Talat, Iryna Gurevych, Timothy Baldwin

Viaarxiv icon

LLM360: Towards Fully Transparent Open-Source LLMs

Add code
Bookmark button
Alert button
Dec 11, 2023
Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze, Preslav Nakov, Tim Baldwin, Eric P. Xing

Figure 1 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 2 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 3 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 4 for LLM360: Towards Fully Transparent Open-Source LLMs
Viaarxiv icon

Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU

Add code
Bookmark button
Alert button
Oct 07, 2023
Fajri Koto, Nurul Aisyah, Haonan Li, Timothy Baldwin

Viaarxiv icon

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Add code
Bookmark button
Alert button
Sep 20, 2023
Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Maulana Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Wahyuning Linuwih, Bryan Wilie, Galih Pradipta Muridan, Genta Indra Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

Figure 1 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 2 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 3 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 4 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Viaarxiv icon

Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings

Add code
Bookmark button
Alert button
Sep 15, 2023
Chen Cecilia Liu, Fajri Koto, Timothy Baldwin, Iryna Gurevych

Viaarxiv icon

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

Add code
Bookmark button
Alert button
Aug 30, 2023
Neha Sengupta, Sunil Kumar Sahu, Bokang Jia, Satheesh Katipomu, Haonan Li, Fajri Koto, Osama Mohammed Afzal, Samta Kamboj, Onkar Pandit, Rahul Pal, Lalit Pradhan, Zain Muhammad Mujahid, Massa Baali, Alham Fikri Aji, Zhengzhong Liu, Andy Hock, Andrew Feldman, Jonathan Lee, Andrew Jackson, Preslav Nakov, Timothy Baldwin, Eric Xing

Figure 1 for Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Figure 2 for Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Figure 3 for Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Figure 4 for Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models
Viaarxiv icon

CMMLU: Measuring massive multitask language understanding in Chinese

Add code
Bookmark button
Alert button
Jun 15, 2023
Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, Hai Zhao, Yeyun Gong, Nan Duan, Timothy Baldwin

Figure 1 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 2 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 3 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 4 for CMMLU: Measuring massive multitask language understanding in Chinese
Viaarxiv icon