Picture for Nevasini Sasikumar

Nevasini Sasikumar

SWE-Marathon: Can Agents Autonomously Complete Ultra-Long-Horizon Software Work?

Add code
Jun 05, 2026
Viaarxiv icon

Echelon: Auditable Aggregate-Only Language-Model Adaptation Across Privacy Boundaries

Add code
Jun 01, 2026
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon