Picture for Leshem Choshen

Leshem Choshen

Can Gradient Descent Simulate Prompting?

Add code
Jun 26, 2025
Viaarxiv icon

TextArena

Add code
Apr 15, 2025
Figure 1 for TextArena
Figure 2 for TextArena
Figure 3 for TextArena
Figure 4 for TextArena
Viaarxiv icon

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Add code
Apr 10, 2025
Figure 1 for Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Figure 2 for Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Figure 3 for Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Figure 4 for Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Viaarxiv icon

Pretraining Language Models for Diachronic Linguistic Change Discovery

Add code
Apr 09, 2025
Figure 1 for Pretraining Language Models for Diachronic Linguistic Change Discovery
Figure 2 for Pretraining Language Models for Diachronic Linguistic Change Discovery
Figure 3 for Pretraining Language Models for Diachronic Linguistic Change Discovery
Figure 4 for Pretraining Language Models for Diachronic Linguistic Change Discovery
Viaarxiv icon

DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

Add code
Mar 04, 2025
Viaarxiv icon

The Mighty ToRR: A Benchmark for Table Reasoning and Robustness

Add code
Feb 26, 2025
Figure 1 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 2 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 3 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 4 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Viaarxiv icon

Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families

Add code
Dec 09, 2024
Viaarxiv icon

Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora

Add code
Dec 06, 2024
Viaarxiv icon

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Add code
Dec 04, 2024
Figure 1 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 2 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 3 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Figure 4 for Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
Viaarxiv icon

ZipNN: Lossless Compression for AI Models

Add code
Nov 07, 2024
Figure 1 for ZipNN: Lossless Compression for AI Models
Figure 2 for ZipNN: Lossless Compression for AI Models
Figure 3 for ZipNN: Lossless Compression for AI Models
Figure 4 for ZipNN: Lossless Compression for AI Models
Viaarxiv icon