Alert button
Picture for Tianhe Yu

Tianhe Yu

Alert button

COMBO: Conservative Offline Model-Based Policy Optimization

Feb 16, 2021
Tianhe Yu, Aviral Kumar, Rafael Rafailov, Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Figure 1 for COMBO: Conservative Offline Model-Based Policy Optimization
Figure 2 for COMBO: Conservative Offline Model-Based Policy Optimization
Figure 3 for COMBO: Conservative Offline Model-Based Policy Optimization
Figure 4 for COMBO: Conservative Offline Model-Based Policy Optimization
Viaarxiv icon

Offline Reinforcement Learning from Images with Latent Space Models

Dec 21, 2020
Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

Figure 1 for Offline Reinforcement Learning from Images with Latent Space Models
Figure 2 for Offline Reinforcement Learning from Images with Latent Space Models
Figure 3 for Offline Reinforcement Learning from Images with Latent Space Models
Figure 4 for Offline Reinforcement Learning from Images with Latent Space Models
Viaarxiv icon

Variable-Shot Adaptation for Online Meta-Learning

Dec 14, 2020
Tianhe Yu, Xinyang Geng, Chelsea Finn, Sergey Levine

Figure 1 for Variable-Shot Adaptation for Online Meta-Learning
Figure 2 for Variable-Shot Adaptation for Online Meta-Learning
Figure 3 for Variable-Shot Adaptation for Online Meta-Learning
Figure 4 for Variable-Shot Adaptation for Online Meta-Learning
Viaarxiv icon

Measuring and Harnessing Transference in Multi-Task Learning

Oct 29, 2020
Christopher Fifty, Ehsan Amid, Zhe Zhao, Tianhe Yu, Rohan Anil, Chelsea Finn

Figure 1 for Measuring and Harnessing Transference in Multi-Task Learning
Figure 2 for Measuring and Harnessing Transference in Multi-Task Learning
Figure 3 for Measuring and Harnessing Transference in Multi-Task Learning
Figure 4 for Measuring and Harnessing Transference in Multi-Task Learning
Viaarxiv icon

MOPO: Model-based Offline Policy Optimization

May 27, 2020
Tianhe Yu, Garrett Thomas, Lantao Yu, Stefano Ermon, James Zou, Sergey Levine, Chelsea Finn, Tengyu Ma

Figure 1 for MOPO: Model-based Offline Policy Optimization
Figure 2 for MOPO: Model-based Offline Policy Optimization
Figure 3 for MOPO: Model-based Offline Policy Optimization
Figure 4 for MOPO: Model-based Offline Policy Optimization
Viaarxiv icon

Gradient Surgery for Multi-Task Learning

Jan 19, 2020
Tianhe Yu, Saurabh Kumar, Abhishek Gupta, Sergey Levine, Karol Hausman, Chelsea Finn

Figure 1 for Gradient Surgery for Multi-Task Learning
Figure 2 for Gradient Surgery for Multi-Task Learning
Figure 3 for Gradient Surgery for Multi-Task Learning
Figure 4 for Gradient Surgery for Multi-Task Learning
Viaarxiv icon

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Oct 26, 2019
Lantao Yu, Tianhe Yu, Chelsea Finn, Stefano Ermon

Figure 1 for Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Figure 2 for Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Figure 3 for Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Figure 4 for Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Viaarxiv icon

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Oct 24, 2019
Tianhe Yu, Deirdre Quillen, Zhanpeng He, Ryan Julian, Karol Hausman, Chelsea Finn, Sergey Levine

Figure 1 for Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Figure 2 for Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Figure 3 for Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Figure 4 for Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Viaarxiv icon

Unsupervised Visuomotor Control through Distributional Planning Networks

Feb 14, 2019
Tianhe Yu, Gleb Shevchuk, Dorsa Sadigh, Chelsea Finn

Figure 1 for Unsupervised Visuomotor Control through Distributional Planning Networks
Figure 2 for Unsupervised Visuomotor Control through Distributional Planning Networks
Figure 3 for Unsupervised Visuomotor Control through Distributional Planning Networks
Figure 4 for Unsupervised Visuomotor Control through Distributional Planning Networks
Viaarxiv icon