Alert button
Picture for Jiantao Jiao

Jiantao Jiao

Alert button

Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

Add code
Bookmark button
Alert button
Nov 01, 2022
Paria Rashidinejad, Hanlin Zhu, Kunhe Yang, Stuart Russell, Jiantao Jiao

Figure 1 for Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian
Viaarxiv icon

Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits

Add code
Bookmark button
Alert button
Nov 01, 2022
Yifei Wang, Tavor Baharav, Yanjun Han, Jiantao Jiao, David Tse

Figure 1 for Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits
Figure 2 for Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits
Figure 3 for Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits
Figure 4 for Beyond the Best: Estimating Distribution Functionals in Infinite-Armed Bandits
Viaarxiv icon

Minimax Optimal Online Imitation Learning via Replay Estimation

Add code
Bookmark button
Alert button
Jun 02, 2022
Gokul Swamy, Nived Rajaraman, Matthew Peng, Sanjiban Choudhury, J. Andrew Bagnell, Zhiwei Steven Wu, Jiantao Jiao, Kannan Ramchandran

Figure 1 for Minimax Optimal Online Imitation Learning via Replay Estimation
Figure 2 for Minimax Optimal Online Imitation Learning via Replay Estimation
Figure 3 for Minimax Optimal Online Imitation Learning via Replay Estimation
Figure 4 for Minimax Optimal Online Imitation Learning via Replay Estimation
Viaarxiv icon

Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees

Add code
Bookmark button
Alert button
May 24, 2022
Banghua Zhu, Lun Wang, Qi Pang, Shuai Wang, Jiantao Jiao, Dawn Song, Michael I. Jordan

Figure 1 for Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees
Figure 2 for Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees
Figure 3 for Byzantine-Robust Federated Learning with Optimal Statistical Rates and Privacy Guarantees
Viaarxiv icon

Jump-Start Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 05, 2022
Ikechukwu Uchendu, Ted Xiao, Yao Lu, Banghua Zhu, Mengyuan Yan, Joséphine Simon, Matthew Bennice, Chuyuan Fu, Cong Ma, Jiantao Jiao, Sergey Levine, Karol Hausman

Figure 1 for Jump-Start Reinforcement Learning
Figure 2 for Jump-Start Reinforcement Learning
Figure 3 for Jump-Start Reinforcement Learning
Figure 4 for Jump-Start Reinforcement Learning
Viaarxiv icon

Robust Estimation for Nonparametric Families via Generative Adversarial Networks

Add code
Bookmark button
Alert button
Feb 02, 2022
Banghua Zhu, Jiantao Jiao, Michael I. Jordan

Viaarxiv icon

Nearly Optimal Policy Optimization with Stable at Any Time Guarantee

Add code
Bookmark button
Alert button
Dec 22, 2021
Tianhao Wu, Yunchang Yang, Han Zhong, Liwei Wang, Simon S. Du, Jiantao Jiao

Figure 1 for Nearly Optimal Policy Optimization with Stable at Any Time Guarantee
Viaarxiv icon

Computational Benefits of Intermediate Rewards for Hierarchical Planning

Add code
Bookmark button
Alert button
Jul 08, 2021
Yuexiang Zhai, Christina Baek, Zhengyuan Zhou, Jiantao Jiao, Yi Ma

Figure 1 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 2 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 3 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Figure 4 for Computational Benefits of Intermediate Rewards for Hierarchical Planning
Viaarxiv icon

MADE: Exploration via Maximizing Deviation from Explored Regions

Add code
Bookmark button
Alert button
Jun 18, 2021
Tianjun Zhang, Paria Rashidinejad, Jiantao Jiao, Yuandong Tian, Joseph Gonzalez, Stuart Russell

Figure 1 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 2 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 3 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 4 for MADE: Exploration via Maximizing Deviation from Explored Regions
Viaarxiv icon