Alert button
Picture for Yangchen Pan

Yangchen Pan

Alert button

A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization

Add code
Bookmark button
Alert button
Mar 20, 2024
Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart

Figure 1 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 2 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 3 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Figure 4 for A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Viaarxiv icon

Improving Adversarial Transferability via Model Alignment

Add code
Bookmark button
Alert button
Nov 30, 2023
Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu

Viaarxiv icon

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

Add code
Bookmark button
Alert button
Aug 13, 2023
Avery Ma, Yangchen Pan, Amir-massoud Farahmand

Figure 1 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 2 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 3 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Figure 4 for Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
Viaarxiv icon

An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient

Add code
Bookmark button
Alert button
Aug 09, 2023
Yudong Luo, Guiliang Liu, Pascal Poupart, Yangchen Pan

Figure 1 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 2 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 3 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Figure 4 for An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Viaarxiv icon

Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 16, 2023
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran

Figure 1 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 2 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 3 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Figure 4 for Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

The In-Sample Softmax for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 28, 2023
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White

Figure 1 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 2 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 3 for The In-Sample Softmax for Offline Reinforcement Learning
Figure 4 for The In-Sample Softmax for Offline Reinforcement Learning
Viaarxiv icon

Label Alignment Regularization for Distribution Shift

Add code
Bookmark button
Alert button
Nov 27, 2022
Ehsan Imani, Guojun Zhang, Jun Luo, Pascal Poupart, Yangchen Pan

Figure 1 for Label Alignment Regularization for Distribution Shift
Figure 2 for Label Alignment Regularization for Distribution Shift
Figure 3 for Label Alignment Regularization for Distribution Shift
Figure 4 for Label Alignment Regularization for Distribution Shift
Viaarxiv icon

Memory-efficient Reinforcement Learning with Knowledge Consolidation

Add code
Bookmark button
Alert button
May 22, 2022
Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood

Figure 1 for Memory-efficient Reinforcement Learning with Knowledge Consolidation
Figure 2 for Memory-efficient Reinforcement Learning with Knowledge Consolidation
Figure 3 for Memory-efficient Reinforcement Learning with Knowledge Consolidation
Figure 4 for Memory-efficient Reinforcement Learning with Knowledge Consolidation
Viaarxiv icon

An Alternate Policy Gradient Estimator for Softmax Policies

Add code
Bookmark button
Alert button
Dec 22, 2021
Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, A. Rupam Mahmood

Figure 1 for An Alternate Policy Gradient Estimator for Softmax Policies
Figure 2 for An Alternate Policy Gradient Estimator for Softmax Policies
Figure 3 for An Alternate Policy Gradient Estimator for Softmax Policies
Figure 4 for An Alternate Policy Gradient Estimator for Softmax Policies
Viaarxiv icon