Picture for Jason Weston

Jason Weston

Google

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Add code
Aug 18, 2025
Viaarxiv icon

Learning to Reason for Factuality

Add code
Aug 07, 2025
Viaarxiv icon

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Add code
Jul 31, 2025
Viaarxiv icon

MetaCLIP 2: A Worldwide Scaling Recipe

Add code
Jul 29, 2025
Viaarxiv icon

NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks

Add code
Jul 02, 2025
Viaarxiv icon

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Add code
May 15, 2025
Viaarxiv icon

Multi-Token Attention

Add code
Apr 01, 2025
Figure 1 for Multi-Token Attention
Figure 2 for Multi-Token Attention
Figure 3 for Multi-Token Attention
Figure 4 for Multi-Token Attention
Viaarxiv icon

SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks

Add code
Mar 19, 2025
Figure 1 for SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
Figure 2 for SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
Figure 3 for SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
Figure 4 for SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

Diverse Preference Optimization

Add code
Jan 31, 2025
Figure 1 for Diverse Preference Optimization
Figure 2 for Diverse Preference Optimization
Figure 3 for Diverse Preference Optimization
Figure 4 for Diverse Preference Optimization
Viaarxiv icon