Picture for Charlie Snell

Charlie Snell

Learning Adaptive Parallel Reasoning with Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Sleep-time Compute: Beyond Inference Scaling at Test-time

Add code
Apr 17, 2025
Viaarxiv icon

Reasoning Models Can Be Effective Without Thinking

Add code
Apr 14, 2025
Viaarxiv icon

Value-Based Deep RL Scales Predictably

Add code
Feb 06, 2025
Viaarxiv icon

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought

Add code
Jan 08, 2025
Viaarxiv icon

Predicting Emergent Capabilities by Finetuning

Add code
Nov 25, 2024
Figure 1 for Predicting Emergent Capabilities by Finetuning
Figure 2 for Predicting Emergent Capabilities by Finetuning
Figure 3 for Predicting Emergent Capabilities by Finetuning
Figure 4 for Predicting Emergent Capabilities by Finetuning
Viaarxiv icon

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Add code
Aug 06, 2024
Figure 1 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 2 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 3 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Figure 4 for Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

The False Promise of Imitating Proprietary LLMs

Add code
May 25, 2023
Figure 1 for The False Promise of Imitating Proprietary LLMs
Figure 2 for The False Promise of Imitating Proprietary LLMs
Figure 3 for The False Promise of Imitating Proprietary LLMs
Figure 4 for The False Promise of Imitating Proprietary LLMs
Viaarxiv icon

Learning by Distilling Context

Add code
Sep 30, 2022
Figure 1 for Learning by Distilling Context
Figure 2 for Learning by Distilling Context
Figure 3 for Learning by Distilling Context
Figure 4 for Learning by Distilling Context
Viaarxiv icon