Picture for Quanquan Gu

Quanquan Gu

Online KL-Regularized Reinforcement Learning with Function Approximation under Misspecification

Add code
Jun 04, 2026
Viaarxiv icon

Unlocking Feature Learning in Gated Delta Networks at Scale

Add code
Jun 02, 2026
Viaarxiv icon

Self-Distilled Policy Gradient

Add code
Jun 02, 2026
Viaarxiv icon

Fast Rates for Offline Contextual Bandits with Forward-KL Regularization under Single-Policy Concentrability

Add code
May 09, 2026
Viaarxiv icon

On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization

Add code
May 04, 2026
Viaarxiv icon

Transformers Trained via Gradient Descent Can Provably Learn a Class of Teacher Models

Add code
Mar 24, 2026
Viaarxiv icon

Near-Optimal Regret for KL-Regularized Multi-Armed Bandits

Add code
Mar 02, 2026
Viaarxiv icon

Dimension-Independent Convergence of Underdamped Langevin Monte Carlo in KL Divergence

Add code
Mar 02, 2026
Viaarxiv icon

Protein Autoregressive Modeling via Multiscale Structure Generation

Add code
Feb 04, 2026
Viaarxiv icon

Scalable Spatio-Temporal SE(3) Diffusion for Long-Horizon Protein Dynamics

Add code
Feb 02, 2026
Viaarxiv icon