Alert button
Picture for Scott M. Jordan

Scott M. Jordan

Alert button

From Past to Future: Rethinking Eligibility Traces

Add code
Bookmark button
Alert button
Dec 20, 2023
Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva

Viaarxiv icon

Behavior Alignment via Reward Function Optimization

Add code
Bookmark button
Alert button
Oct 31, 2023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva

Viaarxiv icon

Coagent Networks: Generalized and Scaled

Add code
Bookmark button
Alert button
May 16, 2023
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas

Figure 1 for Coagent Networks: Generalized and Scaled
Figure 2 for Coagent Networks: Generalized and Scaled
Figure 3 for Coagent Networks: Generalized and Scaled
Figure 4 for Coagent Networks: Generalized and Scaled
Viaarxiv icon

Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model

Add code
Bookmark button
Alert button
Feb 02, 2023
Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang

Figure 1 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 2 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 3 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Figure 4 for Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model
Viaarxiv icon

Towards Safe Policy Improvement for Non-Stationary MDPs

Add code
Bookmark button
Alert button
Oct 23, 2020
Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas

Figure 1 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 2 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 3 for Towards Safe Policy Improvement for Non-Stationary MDPs
Figure 4 for Towards Safe Policy Improvement for Non-Stationary MDPs
Viaarxiv icon

Evaluating the Performance of Reinforcement Learning Algorithms

Add code
Bookmark button
Alert button
Jun 30, 2020
Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas

Figure 1 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 2 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 3 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 4 for Evaluating the Performance of Reinforcement Learning Algorithms
Viaarxiv icon

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Bookmark button
Alert button
Jun 06, 2019
Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James Kostas

Viaarxiv icon