Picture for Jason E Weston

Jason E Weston

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Add code
Oct 08, 2025
Figure 1 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 2 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 3 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 4 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Viaarxiv icon

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

An Overview of Large Language Models for Statisticians

Add code
Feb 25, 2025
Viaarxiv icon

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Add code
Feb 18, 2025
Viaarxiv icon