Picture for Genta Indra Winata

Genta Indra Winata

Shammie

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

Add code
Jun 14, 2024
Figure 1 for ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
Figure 2 for ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
Figure 3 for ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
Figure 4 for ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
Viaarxiv icon

MINERS: Multilingual Language Models as Semantic Retrievers

Add code
Jun 11, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages

Add code
Apr 09, 2024
Figure 1 for Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Figure 2 for Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Figure 3 for Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Figure 4 for Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
Viaarxiv icon

LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization

Add code
Feb 01, 2024
Figure 1 for LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Figure 2 for LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Figure 3 for LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Figure 4 for LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
Viaarxiv icon

IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages

Add code
Nov 21, 2023
Viaarxiv icon

IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems

Add code
Nov 02, 2023
Figure 1 for IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems
Figure 2 for IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems
Figure 3 for IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems
Figure 4 for IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems
Viaarxiv icon

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages

Add code
Sep 20, 2023
Figure 1 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 2 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 3 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Figure 4 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Viaarxiv icon

Multilingual Few-Shot Learning via Language Model Retrieval

Add code
Jun 19, 2023
Figure 1 for Multilingual Few-Shot Learning via Language Model Retrieval
Figure 2 for Multilingual Few-Shot Learning via Language Model Retrieval
Figure 3 for Multilingual Few-Shot Learning via Language Model Retrieval
Figure 4 for Multilingual Few-Shot Learning via Language Model Retrieval
Viaarxiv icon

On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research

Add code
Jun 05, 2023
Viaarxiv icon