Alert button
Picture for Jiantao Jiao

Jiantao Jiao

Alert button

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

Mar 22, 2021
Paria Rashidinejad, Banghua Zhu, Cong Ma, Jiantao Jiao, Stuart Russell

Figure 1 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 2 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 3 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 4 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Viaarxiv icon

Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally

Feb 25, 2021
Nived Rajaraman, Yanjun Han, Lin F. Yang, Kannan Ramchandran, Jiantao Jiao

Figure 1 for Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally
Figure 2 for Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally
Viaarxiv icon

Minimax Off-Policy Evaluation for Multi-Armed Bandits

Jan 19, 2021
Cong Ma, Banghua Zhu, Jiantao Jiao, Martin J. Wainwright

Figure 1 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 2 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 3 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Figure 4 for Minimax Off-Policy Evaluation for Multi-Armed Bandits
Viaarxiv icon

Linear Representation Meta-Reinforcement Learning for Instant Adaptation

Jan 12, 2021
Matt Peng, Banghua Zhu, Jiantao Jiao

Figure 1 for Linear Representation Meta-Reinforcement Learning for Instant Adaptation
Figure 2 for Linear Representation Meta-Reinforcement Learning for Instant Adaptation
Figure 3 for Linear Representation Meta-Reinforcement Learning for Instant Adaptation
Figure 4 for Linear Representation Meta-Reinforcement Learning for Instant Adaptation
Viaarxiv icon

SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory

Oct 12, 2020
Paria Rashidinejad, Jiantao Jiao, Stuart Russell

Figure 1 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Figure 2 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Figure 3 for SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory
Viaarxiv icon

Toward the Fundamental Limits of Imitation Learning

Sep 13, 2020
Nived Rajaraman, Lin F. Yang, Jiantao Jiao, Kannan Ramachandran

Figure 1 for Toward the Fundamental Limits of Imitation Learning
Figure 2 for Toward the Fundamental Limits of Imitation Learning
Figure 3 for Toward the Fundamental Limits of Imitation Learning
Figure 4 for Toward the Fundamental Limits of Imitation Learning
Viaarxiv icon

Robust estimation via generalized quasi-gradients

May 28, 2020
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

Figure 1 for Robust estimation via generalized quasi-gradients
Viaarxiv icon

When does the Tukey median work?

Jan 21, 2020
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

Figure 1 for When does the Tukey median work?
Figure 2 for When does the Tukey median work?
Viaarxiv icon

Generalized Resilience and Robust Statistics

Sep 19, 2019
Banghua Zhu, Jiantao Jiao, Jacob Steinhardt

Figure 1 for Generalized Resilience and Robust Statistics
Figure 2 for Generalized Resilience and Robust Statistics
Figure 3 for Generalized Resilience and Robust Statistics
Figure 4 for Generalized Resilience and Robust Statistics
Viaarxiv icon

Deconstructing Generative Adversarial Networks

Jan 27, 2019
Banghua Zhu, Jiantao Jiao, David Tse

Figure 1 for Deconstructing Generative Adversarial Networks
Figure 2 for Deconstructing Generative Adversarial Networks
Figure 3 for Deconstructing Generative Adversarial Networks
Figure 4 for Deconstructing Generative Adversarial Networks
Viaarxiv icon