Picture for Alexander Rakhlin

Alexander Rakhlin

Learning with Simulators: No Regret in a Computationally Bounded World

Add code
Jun 11, 2026
Viaarxiv icon

The Sample Complexity of Multiclass and Sparse Contextual Bandits

Add code
May 28, 2026
Viaarxiv icon

End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Add code
Mar 24, 2026
Viaarxiv icon

Characterizing Online and Private Learnability under Distributional Constraints via Generalized Smoothness

Add code
Feb 24, 2026
Viaarxiv icon

High-accuracy log-concave sampling with stochastic queries

Add code
Feb 15, 2026
Viaarxiv icon

High-accuracy sampling for diffusion models and log-concave distributions

Add code
Feb 01, 2026
Viaarxiv icon

Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits

Add code
May 26, 2025
Viaarxiv icon

Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning

Add code
May 21, 2025
Figure 1 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 2 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 3 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 4 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Viaarxiv icon

Near-Optimal Private Learning in Linear Contextual Bandits

Add code
Feb 18, 2025
Viaarxiv icon

Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy

Add code
Jan 24, 2025
Viaarxiv icon