Picture for Ilia Kulikov

Ilia Kulikov

NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks

Add code
Jul 02, 2025
Viaarxiv icon

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Add code
May 15, 2025
Viaarxiv icon

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Add code
Feb 18, 2025
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

Diverse Preference Optimization

Add code
Jan 31, 2025
Figure 1 for Diverse Preference Optimization
Figure 2 for Diverse Preference Optimization
Figure 3 for Diverse Preference Optimization
Figure 4 for Diverse Preference Optimization
Viaarxiv icon

Adaptive Decoding via Latent Preference Optimization

Add code
Nov 14, 2024
Figure 1 for Adaptive Decoding via Latent Preference Optimization
Figure 2 for Adaptive Decoding via Latent Preference Optimization
Figure 3 for Adaptive Decoding via Latent Preference Optimization
Figure 4 for Adaptive Decoding via Latent Preference Optimization
Viaarxiv icon

Self-Taught Evaluators

Add code
Aug 05, 2024
Figure 1 for Self-Taught Evaluators
Figure 2 for Self-Taught Evaluators
Figure 3 for Self-Taught Evaluators
Figure 4 for Self-Taught Evaluators
Viaarxiv icon

Distilling System 2 into System 1

Add code
Jul 09, 2024
Figure 1 for Distilling System 2 into System 1
Figure 2 for Distilling System 2 into System 1
Figure 3 for Distilling System 2 into System 1
Figure 4 for Distilling System 2 into System 1
Viaarxiv icon

Investigating Decoder-only Large Language Models for Speech-to-text Translation

Add code
Jul 03, 2024
Figure 1 for Investigating Decoder-only Large Language Models for Speech-to-text Translation
Figure 2 for Investigating Decoder-only Large Language Models for Speech-to-text Translation
Figure 3 for Investigating Decoder-only Large Language Models for Speech-to-text Translation
Figure 4 for Investigating Decoder-only Large Language Models for Speech-to-text Translation
Viaarxiv icon