Picture for Sunil Madhow

Sunil Madhow

Learnable Chernoff Baselines for Inference-Time Alignment

Add code
Feb 08, 2026
Viaarxiv icon

Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data

Add code
Jun 24, 2023
Viaarxiv icon