Picture for Alexander Rakhlin

Alexander Rakhlin

End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions

Add code
Mar 24, 2026
Viaarxiv icon

Characterizing Online and Private Learnability under Distributional Constraints via Generalized Smoothness

Add code
Feb 24, 2026
Viaarxiv icon

High-accuracy log-concave sampling with stochastic queries

Add code
Feb 15, 2026
Viaarxiv icon

High-accuracy sampling for diffusion models and log-concave distributions

Add code
Feb 01, 2026
Viaarxiv icon

Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits

Add code
May 26, 2025
Viaarxiv icon

Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning

Add code
May 21, 2025
Figure 1 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 2 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 3 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Figure 4 for Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning
Viaarxiv icon

Near-Optimal Private Learning in Linear Contextual Bandits

Add code
Feb 18, 2025
Viaarxiv icon

Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy

Add code
Jan 24, 2025
Viaarxiv icon

Refined Risk Bounds for Unbounded Losses via Transductive Priors

Add code
Oct 29, 2024
Viaarxiv icon

How Does Variance Shape the Regret in Contextual Bandits?

Add code
Oct 16, 2024
Figure 1 for How Does Variance Shape the Regret in Contextual Bandits?
Viaarxiv icon