Picture for David Alvarez-Melis

David Alvarez-Melis

RL Excursions during Pre-Training: Re-examining Policy Optimization for LLM training

Add code
Jun 02, 2026
Viaarxiv icon

Low-Frequency Shortcuts in Texture-Driven Visual Learning

Add code
Jun 02, 2026
Viaarxiv icon

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

Add code
May 28, 2026
Viaarxiv icon

A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

Add code
Feb 16, 2026
Viaarxiv icon

Stop Training for the Worst: Progressive Unmasking Accelerates Masked Diffusion Training

Add code
Feb 10, 2026
Viaarxiv icon

Reliable and Responsible Foundation Models: A Comprehensive Survey

Add code
Feb 04, 2026
Viaarxiv icon

KerJEPA: Kernel Discrepancies for Euclidean Self-Supervised Learning

Add code
Dec 22, 2025
Viaarxiv icon

Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs

Add code
Dec 15, 2025
Viaarxiv icon

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Add code
Oct 06, 2025
Viaarxiv icon

Can Interpretation Predict Behavior on Unseen Data?

Add code
Jul 08, 2025
Viaarxiv icon