Picture for Roberta Raileanu

Roberta Raileanu

Jack

Procedural Generation of Algorithm Discovery Tasks in Machine Learning

Add code
Mar 18, 2026
Viaarxiv icon

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Add code
Feb 09, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Add code
Nov 17, 2025
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon

LLM-First Search: Self-Guided Exploration of the Solution Space

Add code
Jun 05, 2025
Viaarxiv icon

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Add code
Feb 20, 2025
Figure 1 for MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Figure 2 for MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Figure 3 for MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Figure 4 for MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Viaarxiv icon

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Add code
Dec 11, 2024
Figure 1 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 2 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 3 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 4 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Viaarxiv icon

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

Add code
Sep 12, 2024
Figure 1 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 2 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 3 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Figure 4 for Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon