Picture for Genta Indra Winata

Genta Indra Winata

Shammie

SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages

Add code
Aug 09, 2025
Viaarxiv icon

IndoPref: A Multi-Domain Pairwise Preference Dataset for Indonesian

Add code
Jul 29, 2025
Viaarxiv icon

Language Surgery in Multilingual Large Language Models

Add code
Jun 14, 2025
Viaarxiv icon

T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning

Add code
May 22, 2025
Viaarxiv icon

R3: Robust Rubric-Agnostic Reward Models

Add code
May 19, 2025
Viaarxiv icon

Behind Maya: Building a Multilingual Vision Language Model

Add code
May 15, 2025
Viaarxiv icon

Crosslingual Reasoning through Test-Time Scaling

Add code
May 08, 2025
Viaarxiv icon

What Causes Knowledge Loss in Multilingual Language Models?

Add code
Apr 29, 2025
Figure 1 for What Causes Knowledge Loss in Multilingual Language Models?
Figure 2 for What Causes Knowledge Loss in Multilingual Language Models?
Figure 3 for What Causes Knowledge Loss in Multilingual Language Models?
Figure 4 for What Causes Knowledge Loss in Multilingual Language Models?
Viaarxiv icon

Fine-Tuning Diffusion Generative Models via Rich Preference Optimization

Add code
Mar 13, 2025
Viaarxiv icon

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Add code
Mar 10, 2025
Viaarxiv icon