Picture for Michael L. Littman

Michael L. Littman

Rutgers University

Bad-Policy Density: A Measure of Reinforcement Learning Hardness

Add code
Oct 07, 2021
Figure 1 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 2 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 3 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 4 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Viaarxiv icon

Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback

Add code
Sep 15, 2021
Figure 1 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Figure 2 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Viaarxiv icon

Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program

Add code
Jun 10, 2021
Figure 1 for Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program
Figure 2 for Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program
Figure 3 for Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program
Figure 4 for Brittle AI, Causal Confusion, and Bad Mental Models: Challenges and Successes in the XAI Program
Viaarxiv icon

Control of mental representations in human planning

Add code
May 14, 2021
Figure 1 for Control of mental representations in human planning
Figure 2 for Control of mental representations in human planning
Figure 3 for Control of mental representations in human planning
Viaarxiv icon

Towards Sample Efficient Agents through Algorithmic Alignment

Add code
Sep 08, 2020
Figure 1 for Towards Sample Efficient Agents through Algorithmic Alignment
Figure 2 for Towards Sample Efficient Agents through Algorithmic Alignment
Figure 3 for Towards Sample Efficient Agents through Algorithmic Alignment
Viaarxiv icon

The Efficiency of Human Cognition Reflects Planned Information Processing

Add code
Feb 13, 2020
Figure 1 for The Efficiency of Human Cognition Reflects Planned Information Processing
Figure 2 for The Efficiency of Human Cognition Reflects Planned Information Processing
Figure 3 for The Efficiency of Human Cognition Reflects Planned Information Processing
Figure 4 for The Efficiency of Human Cognition Reflects Planned Information Processing
Viaarxiv icon

Learning State Abstractions for Transfer in Continuous Control

Add code
Feb 08, 2020
Figure 1 for Learning State Abstractions for Transfer in Continuous Control
Figure 2 for Learning State Abstractions for Transfer in Continuous Control
Figure 3 for Learning State Abstractions for Transfer in Continuous Control
Viaarxiv icon

Deep RBF Value Functions for Continuous Control

Add code
Feb 05, 2020
Figure 1 for Deep RBF Value Functions for Continuous Control
Figure 2 for Deep RBF Value Functions for Continuous Control
Figure 3 for Deep RBF Value Functions for Continuous Control
Figure 4 for Deep RBF Value Functions for Continuous Control
Viaarxiv icon

Lipschitz Lifelong Reinforcement Learning

Add code
Jan 17, 2020
Figure 1 for Lipschitz Lifelong Reinforcement Learning
Figure 2 for Lipschitz Lifelong Reinforcement Learning
Figure 3 for Lipschitz Lifelong Reinforcement Learning
Figure 4 for Lipschitz Lifelong Reinforcement Learning
Viaarxiv icon

Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging

Add code
Dec 08, 2019
Figure 1 for Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging
Figure 2 for Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging
Figure 3 for Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging
Figure 4 for Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging
Viaarxiv icon