Picture for Aviral Kumar

Aviral Kumar

POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration

Add code
Jan 26, 2026
Viaarxiv icon

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Add code
Jan 20, 2026
Viaarxiv icon

TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks

Add code
Jan 15, 2026
Viaarxiv icon

WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Add code
Jan 07, 2026
Viaarxiv icon

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Add code
Oct 02, 2025
Figure 1 for RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
Figure 2 for RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
Figure 3 for RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
Figure 4 for RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems
Viaarxiv icon

RaC: Robot Learning for Long-Horizon Tasks by Scaling Recovery and Correction

Add code
Sep 09, 2025
Viaarxiv icon

Compute-Optimal Scaling for Value-Based Deep RL

Add code
Aug 20, 2025
Viaarxiv icon

Reasoning as an Adaptive Defense for Safety

Add code
Jul 01, 2025
Figure 1 for Reasoning as an Adaptive Defense for Safety
Figure 2 for Reasoning as an Adaptive Defense for Safety
Figure 3 for Reasoning as an Adaptive Defense for Safety
Figure 4 for Reasoning as an Adaptive Defense for Safety
Viaarxiv icon

e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs

Add code
Jun 10, 2025
Viaarxiv icon

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

Add code
Jun 09, 2025
Figure 1 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 2 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 3 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Figure 4 for Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction
Viaarxiv icon