Picture for Jiafan He

Jiafan He

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Add code
May 15, 2023
Viaarxiv icon

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

Add code
May 12, 2023
Viaarxiv icon

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

Add code
Mar 16, 2023
Figure 1 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 2 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 3 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Figure 4 for On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
Viaarxiv icon

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

Add code
Feb 21, 2023
Viaarxiv icon

Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

Add code
Dec 12, 2022
Viaarxiv icon

A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits

Add code
Jul 07, 2022
Figure 1 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Figure 2 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Figure 3 for A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
Viaarxiv icon

Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

Add code
May 13, 2022
Figure 1 for Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions
Viaarxiv icon

Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds

Add code
Feb 28, 2022
Figure 1 for Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds
Viaarxiv icon

Learning Stochastic Shortest Path with Linear Function Approximation

Add code
Oct 25, 2021
Figure 1 for Learning Stochastic Shortest Path with Linear Function Approximation
Viaarxiv icon

Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes

Add code
Oct 19, 2021
Figure 1 for Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes
Figure 2 for Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes
Viaarxiv icon