Alert button
Picture for Christoph Dann

Christoph Dann

Alert button

A Minimaximalist Approach to Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jan 08, 2024
Gokul Swamy, Christoph Dann, Rahul Kidambi, Zhiwei Steven Wu, Alekh Agarwal

Viaarxiv icon

Data-Driven Regret Balancing for Online Model Selection in Bandits

Add code
Bookmark button
Alert button
Jun 05, 2023
Aldo Pacchiano, Christoph Dann, Claudio Gentile

Figure 1 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 2 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 3 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Figure 4 for Data-Driven Regret Balancing for Online Model Selection in Bandits
Viaarxiv icon

A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

Add code
Bookmark button
Alert button
Feb 20, 2023
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Viaarxiv icon

Best of Both Worlds Policy Optimization

Add code
Bookmark button
Alert button
Feb 18, 2023
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for Best of Both Worlds Policy Optimization
Viaarxiv icon

Learning in POMDPs is Sample-Efficient with Hindsight Observability

Add code
Bookmark button
Alert button
Feb 03, 2023
Jonathan N. Lee, Alekh Agarwal, Christoph Dann, Tong Zhang

Figure 1 for Learning in POMDPs is Sample-Efficient with Hindsight Observability
Figure 2 for Learning in POMDPs is Sample-Efficient with Hindsight Observability
Figure 3 for Learning in POMDPs is Sample-Efficient with Hindsight Observability
Viaarxiv icon

Pseudonorm Approachability and Applications to Regret Minimization

Add code
Bookmark button
Alert button
Feb 03, 2023
Christoph Dann, Yishay Mansour, Mehryar Mohri, Jon Schneider, Balasubramanian Sivan

Viaarxiv icon

A Unified Algorithm for Stochastic Path Problems

Add code
Bookmark button
Alert button
Oct 17, 2022
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for A Unified Algorithm for Stochastic Path Problems
Figure 2 for A Unified Algorithm for Stochastic Path Problems
Viaarxiv icon

A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 23, 2022
Christoph Dann, Mehryar Mohri, Tong Zhang, Julian Zimmert

Viaarxiv icon

Best of Both Worlds Model Selection

Add code
Bookmark button
Alert button
Jun 29, 2022
Aldo Pacchiano, Christoph Dann, Claudio Gentile

Viaarxiv icon