Alert button
Picture for Keith W. Ross

Keith W. Ross

Alert button

Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance

Add code
Bookmark button
Alert button
Nov 17, 2021
Yanqiu Wu, Xinyue Chen, Che Wang, Yiming Zhang, Zijian Zhou, Keith W. Ross

Figure 1 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 2 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 3 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Figure 4 for Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance
Viaarxiv icon

On-Policy Deep Reinforcement Learning for the Average-Reward Criterion

Add code
Bookmark button
Alert button
Jun 14, 2021
Yiming Zhang, Keith W. Ross

Figure 1 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 2 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 3 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Figure 4 for On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
Viaarxiv icon

First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 16, 2020
Yiming Zhang, Quan Vuong, Keith W. Ross

Figure 1 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 2 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 3 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Figure 4 for First Order Optimization in Policy Space for Constrained Deep Reinforcement Learning
Viaarxiv icon

Supervised Policy Update for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 24, 2018
Quan Vuong, Yiming Zhang, Keith W. Ross

Figure 1 for Supervised Policy Update for Deep Reinforcement Learning
Figure 2 for Supervised Policy Update for Deep Reinforcement Learning
Figure 3 for Supervised Policy Update for Deep Reinforcement Learning
Figure 4 for Supervised Policy Update for Deep Reinforcement Learning
Viaarxiv icon

Efficient Entropy for Policy Gradient with Multidimensional Action Space

Add code
Bookmark button
Alert button
Jun 02, 2018
Yiming Zhang, Quan Ho Vuong, Kenny Song, Xiao-Yue Gong, Keith W. Ross

Figure 1 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 2 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 3 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Figure 4 for Efficient Entropy for Policy Gradient with Multidimensional Action Space
Viaarxiv icon