Picture for Lucas Bandarkar

Lucas Bandarkar

Extracting Small Translation Specialists from LLMs by Aggressively Pruning Experts

Add code
May 27, 2026
Viaarxiv icon

Large Reasoning Models Struggle to Transfer Parametric Knowledge Across Scripts

Add code
Mar 17, 2026
Viaarxiv icon

Knowledge Localization in Mixture-of-Experts LLMs Using Cross-Lingual Inconsistency

Add code
Mar 17, 2026
Viaarxiv icon

Translation as a Scalable Proxy for Multilingual Evaluation

Add code
Jan 16, 2026
Viaarxiv icon

Multilingual Routing in Mixture-of-Experts

Add code
Oct 06, 2025
Viaarxiv icon

The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs

Add code
May 23, 2025
Viaarxiv icon

FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection

Add code
Feb 24, 2025
Figure 1 for FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection
Figure 2 for FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection
Figure 3 for FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection
Figure 4 for FIG: Forward-Inverse Generation for Low-Resource Domain-specific Event Detection
Viaarxiv icon

Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models

Add code
Oct 02, 2024
Figure 1 for Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Figure 2 for Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Figure 3 for Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Figure 4 for Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models
Viaarxiv icon

The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants

Add code
Aug 31, 2023
Figure 1 for The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Figure 2 for The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Figure 3 for The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Figure 4 for The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Viaarxiv icon

Can Transformer Models Measure Coherence In Text? Re-Thinking the Shuffle Test

Add code
Jul 07, 2021
Figure 1 for Can Transformer Models Measure Coherence In Text? Re-Thinking the Shuffle Test
Figure 2 for Can Transformer Models Measure Coherence In Text? Re-Thinking the Shuffle Test
Figure 3 for Can Transformer Models Measure Coherence In Text? Re-Thinking the Shuffle Test
Figure 4 for Can Transformer Models Measure Coherence In Text? Re-Thinking the Shuffle Test
Viaarxiv icon