Picture for Dylan J. Foster

Dylan J. Foster

Learning to Reason with Curriculum I: Provable Benefits of Autocurriculum

Add code
Mar 18, 2026
Viaarxiv icon

Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference

Add code
Mar 09, 2026
Viaarxiv icon

Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment

Add code
Mar 27, 2025
Figure 1 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 2 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 3 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Figure 4 for Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment
Viaarxiv icon

Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration

Add code
Mar 10, 2025
Viaarxiv icon

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification

Add code
Feb 18, 2025
Viaarxiv icon

Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning

Add code
Feb 12, 2025
Figure 1 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Figure 2 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Figure 3 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Viaarxiv icon

Self-Improvement in Language Models: The Sharpening Mechanism

Add code
Dec 02, 2024
Figure 1 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 2 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 3 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 4 for Self-Improvement in Language Models: The Sharpening Mechanism
Viaarxiv icon

Reinforcement Learning under Latent Dynamics: Toward Statistical and Algorithmic Modularity

Add code
Oct 23, 2024
Figure 1 for Reinforcement Learning under Latent Dynamics: Toward Statistical and Algorithmic Modularity
Viaarxiv icon

Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability

Add code
Oct 07, 2024
Viaarxiv icon

Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning

Add code
Jul 20, 2024
Figure 1 for Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Figure 2 for Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Figure 3 for Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Figure 4 for Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning
Viaarxiv icon