Alert button
Picture for Tengyu Xu

Tengyu Xu

Alert button

Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward

Add code
Bookmark button
Alert button
Jun 13, 2022
Tengyu Xu, Yingbin Liang

Viaarxiv icon

Model-Based Offline Meta-Reinforcement Learning with Regularization

Add code
Bookmark button
Alert button
Feb 07, 2022
Sen Lin, Jialin Wan, Tengyu Xu, Yingbin Liang, Junshan Zhang

Figure 1 for Model-Based Offline Meta-Reinforcement Learning with Regularization
Figure 2 for Model-Based Offline Meta-Reinforcement Learning with Regularization
Figure 3 for Model-Based Offline Meta-Reinforcement Learning with Regularization
Figure 4 for Model-Based Offline Meta-Reinforcement Learning with Regularization
Viaarxiv icon

Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process

Add code
Bookmark button
Alert button
Oct 20, 2021
Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan

Figure 1 for Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process
Viaarxiv icon

PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method

Add code
Bookmark button
Alert button
Oct 13, 2021
Ziwei Guan, Tengyu Xu, Yingbin Liang

Figure 1 for PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method
Figure 2 for PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method
Figure 3 for PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method
Figure 4 for PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method
Viaarxiv icon

A Unified Off-Policy Evaluation Approach for General Value Function

Add code
Bookmark button
Alert button
Jul 06, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Figure 1 for A Unified Off-Policy Evaluation Approach for General Value Function
Figure 2 for A Unified Off-Policy Evaluation Approach for General Value Function
Viaarxiv icon

Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality

Add code
Bookmark button
Alert button
Feb 27, 2021
Tengyu Xu, Zhuoran Yang, Zhaoran Wang, Yingbin Liang

Figure 1 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 2 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Figure 3 for Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Viaarxiv icon

Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Add code
Bookmark button
Alert button
Feb 17, 2021
Ziyi Chen, Yi Zhou, Tengyu Xu, Yingbin Liang

Figure 1 for Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry
Viaarxiv icon

A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis

Add code
Bookmark button
Alert button
Nov 17, 2020
Tengyu Xu, Yingbin Liang, Guanghui Lan

Figure 1 for A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis
Viaarxiv icon