Picture for Dhruv Rohatgi

Dhruv Rohatgi

The Power of Test-Time Training for Approximate Sampling

Add code
Jun 09, 2026
Viaarxiv icon

Reject, Resample, Repeat: Understanding Parallel Reasoning in Language Model Inference

Add code
Mar 09, 2026
Viaarxiv icon

Steering diffusion models with quadratic rewards: a fine-grained analysis

Add code
Feb 18, 2026
Viaarxiv icon

Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration

Add code
Mar 10, 2025
Viaarxiv icon

Computational-Statistical Tradeoffs at the Next-Token Prediction Barrier: Autoregressive and Imitation Learning under Misspecification

Add code
Feb 18, 2025
Viaarxiv icon

Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning

Add code
Feb 12, 2025
Figure 1 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Figure 2 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Figure 3 for Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning
Viaarxiv icon

Self-Improvement in Language Models: The Sharpening Mechanism

Add code
Dec 02, 2024
Figure 1 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 2 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 3 for Self-Improvement in Language Models: The Sharpening Mechanism
Figure 4 for Self-Improvement in Language Models: The Sharpening Mechanism
Viaarxiv icon

Towards characterizing the value of edge embeddings in Graph Neural Networks

Add code
Oct 13, 2024
Viaarxiv icon

Online Control in Population Dynamics

Add code
Jun 03, 2024
Figure 1 for Online Control in Population Dynamics
Figure 2 for Online Control in Population Dynamics
Figure 3 for Online Control in Population Dynamics
Figure 4 for Online Control in Population Dynamics
Viaarxiv icon

Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning

Add code
Apr 04, 2024
Figure 1 for Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning
Figure 2 for Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning
Figure 3 for Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning
Viaarxiv icon