Picture for Sanchit Ahuja

Sanchit Ahuja

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

Add code
Jun 12, 2026
Viaarxiv icon

Parameter Alignment Mitigates Catastrophic Forgetting in Multilingual Expert Language Models

Add code
May 29, 2026
Viaarxiv icon

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation

Add code
Feb 18, 2026
Viaarxiv icon

Contamination Report for Multilingual Benchmarks

Add code
Oct 21, 2024
Figure 1 for Contamination Report for Multilingual Benchmarks
Figure 2 for Contamination Report for Multilingual Benchmarks
Viaarxiv icon

Scaling Laws for Multilingual Language Models

Add code
Oct 15, 2024
Figure 1 for Scaling Laws for Multilingual Language Models
Figure 2 for Scaling Laws for Multilingual Language Models
Figure 3 for Scaling Laws for Multilingual Language Models
Figure 4 for Scaling Laws for Multilingual Language Models
Viaarxiv icon

sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting

Add code
Jul 16, 2024
Viaarxiv icon

SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages

Add code
Apr 04, 2024
Figure 1 for SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
Figure 2 for SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
Figure 3 for SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
Figure 4 for SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
Viaarxiv icon

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages

Add code
Feb 15, 2024
Figure 1 for SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages
Figure 2 for SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages
Figure 3 for SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages
Figure 4 for SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages
Viaarxiv icon

MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Add code
Nov 13, 2023
Viaarxiv icon