Alert button
Picture for Yuping Luo

Yuping Luo

Alert button

Safe Reinforcement Learning by Imagining the Near Future

Add code
Bookmark button
Alert button
Feb 15, 2022
Garrett Thomas, Yuping Luo, Tengyu Ma

Figure 1 for Safe Reinforcement Learning by Imagining the Near Future
Figure 2 for Safe Reinforcement Learning by Imagining the Near Future
Figure 3 for Safe Reinforcement Learning by Imagining the Near Future
Figure 4 for Safe Reinforcement Learning by Imagining the Near Future
Viaarxiv icon

Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

Add code
Bookmark button
Alert button
Aug 04, 2021
Yuping Luo, Tengyu Ma

Figure 1 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Figure 2 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Figure 3 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Viaarxiv icon

Towards Learning to Play Piano with Dexterous Hands and Touch

Add code
Bookmark button
Alert button
Jun 08, 2021
Huazhe Xu, Yuping Luo, Shaoxiong Wang, Trevor Darrell, Roberto Calandra

Figure 1 for Towards Learning to Play Piano with Dexterous Hands and Touch
Figure 2 for Towards Learning to Play Piano with Dexterous Hands and Touch
Figure 3 for Towards Learning to Play Piano with Dexterous Hands and Touch
Figure 4 for Towards Learning to Play Piano with Dexterous Hands and Touch
Viaarxiv icon

Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning

Add code
Bookmark button
Alert button
Dec 17, 2020
Zhiyuan Li, Yuping Luo, Kaifeng Lyu

Figure 1 for Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
Figure 2 for Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
Figure 3 for Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
Figure 4 for Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning
Viaarxiv icon

Provable Representation Learning for Imitation Learning via Bi-level Optimization

Add code
Bookmark button
Alert button
Feb 24, 2020
Sanjeev Arora, Simon S. Du, Sham Kakade, Yuping Luo, Nikunj Saunshi

Figure 1 for Provable Representation Learning for Imitation Learning via Bi-level Optimization
Figure 2 for Provable Representation Learning for Imitation Learning via Bi-level Optimization
Figure 3 for Provable Representation Learning for Imitation Learning via Bi-level Optimization
Viaarxiv icon

Bootstrapping the Expressivity with Model-based Planning

Add code
Bookmark button
Alert button
Oct 14, 2019
Kefan Dong, Yuping Luo, Tengyu Ma

Figure 1 for Bootstrapping the Expressivity with Model-based Planning
Figure 2 for Bootstrapping the Expressivity with Model-based Planning
Figure 3 for Bootstrapping the Expressivity with Model-based Planning
Figure 4 for Bootstrapping the Expressivity with Model-based Planning
Viaarxiv icon

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Add code
Bookmark button
Alert button
Aug 01, 2019
Yuping Luo, Huazhe Xu, Tengyu Ma

Figure 1 for Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Figure 2 for Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Figure 3 for Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling
Viaarxiv icon

Provably Efficient $Q$-learning with Function Approximation via Distribution Shift Error Checking Oracle

Add code
Bookmark button
Alert button
Jun 14, 2019
Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang

Viaarxiv icon