Picture for Ilia Kulikov

Ilia Kulikov

Safety Alignment of LMs via Non-cooperative Games

Add code
Dec 23, 2025
Viaarxiv icon

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

Add code
Oct 08, 2025
Figure 1 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 2 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 3 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Figure 4 for Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Viaarxiv icon

OptimalThinkingBench: Evaluating Over and Underthinking in LLMs

Add code
Aug 18, 2025
Figure 1 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 2 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 3 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Figure 4 for OptimalThinkingBench: Evaluating Over and Underthinking in LLMs
Viaarxiv icon

Learning to Reason for Factuality

Add code
Aug 07, 2025
Viaarxiv icon

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks

Add code
Jul 31, 2025
Viaarxiv icon

NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks

Add code
Jul 02, 2025
Figure 1 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 2 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 3 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Figure 4 for NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks
Viaarxiv icon

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Add code
May 15, 2025
Viaarxiv icon

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Add code
Feb 18, 2025
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Figure 1 for LLM Pretraining with Continuous Concepts
Figure 2 for LLM Pretraining with Continuous Concepts
Figure 3 for LLM Pretraining with Continuous Concepts
Figure 4 for LLM Pretraining with Continuous Concepts
Viaarxiv icon