Alert button
Picture for Julian Zimmert

Julian Zimmert

Alert button

Optimal cross-learning for contextual bandits with unknown context distributions

Add code
Bookmark button
Alert button
Jan 03, 2024
Jon Schneider, Julian Zimmert

Viaarxiv icon

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

Add code
Bookmark button
Alert button
Oct 17, 2023
Haolin Liu, Chen-Yu Wei, Julian Zimmert

Viaarxiv icon

Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

Add code
Bookmark button
Alert button
Sep 02, 2023
Haolin Liu, Chen-Yu Wei, Julian Zimmert

Figure 1 for Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits
Viaarxiv icon

An Improved Best-of-both-worlds Algorithm for Bandits with Delayed Feedback

Add code
Bookmark button
Alert button
Aug 21, 2023
Saeed Masoudian, Julian Zimmert, Yevgeny Seldin

Viaarxiv icon

A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

Add code
Bookmark button
Alert button
Feb 20, 2023
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for A Blackbox Approach to Best of Both Worlds in Bandits and Beyond
Viaarxiv icon

Best of Both Worlds Policy Optimization

Add code
Bookmark button
Alert button
Feb 18, 2023
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for Best of Both Worlds Policy Optimization
Viaarxiv icon

Refined Regret for Adversarial MDPs with Linear Function Approximation

Add code
Bookmark button
Alert button
Jan 30, 2023
Yan Dai, Haipeng Luo, Chen-Yu Wei, Julian Zimmert

Figure 1 for Refined Regret for Adversarial MDPs with Linear Function Approximation
Viaarxiv icon

A Unified Algorithm for Stochastic Path Problems

Add code
Bookmark button
Alert button
Oct 17, 2022
Christoph Dann, Chen-Yu Wei, Julian Zimmert

Figure 1 for A Unified Algorithm for Stochastic Path Problems
Figure 2 for A Unified Algorithm for Stochastic Path Problems
Viaarxiv icon

A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 23, 2022
Christoph Dann, Mehryar Mohri, Tong Zhang, Julian Zimmert

Viaarxiv icon

A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback

Add code
Bookmark button
Alert button
Jun 29, 2022
Saeed Masoudian, Julian Zimmert, Yevgeny Seldin

Viaarxiv icon