Alert button
Picture for David Moeljadi

David Moeljadi

Alert button

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Add code
Bookmark button
Alert button
Sep 20, 2023
Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Maulana Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Wahyuning Linuwih, Bryan Wilie, Galih Pradipta Muridan, Genta Indra Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung

Figure 1 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 2 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 3 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 4 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Viaarxiv icon

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Add code
Bookmark button
Alert button
Dec 20, 2022
Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

Figure 1 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 2 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 3 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 4 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Viaarxiv icon

NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages

Add code
Bookmark button
Alert button
Aug 01, 2022
Samuel Cahyawijaya, Alham Fikri Aji, Holy Lovenia, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Fajri Koto, David Moeljadi, Karissa Vincentio, Ade Romadhony, Ayu Purwarianti

Figure 1 for NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages
Figure 2 for NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages
Figure 3 for NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages
Figure 4 for NusaCrowd: A Call for Open and Reproducible NLP Research in Indonesian Languages
Viaarxiv icon

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages

Add code
Bookmark button
Alert button
May 31, 2022
Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Rahmad Mahendra, Fajri Koto, Ade Romadhony, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Pascale Fung, Timothy Baldwin, Jey Han Lau, Rico Sennrich, Sebastian Ruder

Figure 1 for NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Figure 2 for NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Figure 3 for NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Figure 4 for NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
Viaarxiv icon

One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia

Add code
Bookmark button
Alert button
Mar 24, 2022
Alham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel Cahyawijaya, Ade Romadhony, Rahmad Mahendra, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Timothy Baldwin, Jey Han Lau, Sebastian Ruder

Figure 1 for One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Figure 2 for One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Figure 3 for One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Figure 4 for One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
Viaarxiv icon