Alert button
Picture for Michael L. Littman

Michael L. Littman

Alert button

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Add code
Bookmark button
Alert button
Jan 18, 2023
Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee, Michael L. Littman, Sandeep Madireddy, Jorge A. Mendez, Eric Q. Nguyen, Christine D. Piatko, Praveen K. Pilly, Aswin Raghavan, Abrar Rahman, Santhosh Kumar Ramakrishnan, Neale Ratzlaff, Andrea Soltoggio, Peter Stone, Indranil Sur, Zhipeng Tang, Saket Tiwari, Kyle Vedder, Felix Wang, Zifan Xu, Angel Yanguas-Gil, Harel Yedidsion, Shangqun Yu, Gautam K. Vallabha

Figure 1 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 2 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 3 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 4 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Viaarxiv icon

Specifying Behavior Preference with Tiered Reward Functions

Add code
Bookmark button
Alert button
Dec 07, 2022
Zhiyuan Zhou, Henry Sowerby, Michael L. Littman

Figure 1 for Specifying Behavior Preference with Tiered Reward Functions
Figure 2 for Specifying Behavior Preference with Tiered Reward Functions
Figure 3 for Specifying Behavior Preference with Tiered Reward Functions
Figure 4 for Specifying Behavior Preference with Tiered Reward Functions
Viaarxiv icon

Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex

Add code
Bookmark button
Alert button
Nov 26, 2022
Charles Lovering, Jessica Zosa Forde, George Konidaris, Ellie Pavlick, Michael L. Littman

Figure 1 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 2 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 3 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 4 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Viaarxiv icon

Reward-Predictive Clustering

Add code
Bookmark button
Alert button
Nov 07, 2022
Lucas Lehnert, Michael J. Frank, Michael L. Littman

Figure 1 for Reward-Predictive Clustering
Figure 2 for Reward-Predictive Clustering
Figure 3 for Reward-Predictive Clustering
Figure 4 for Reward-Predictive Clustering
Viaarxiv icon

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

Add code
Bookmark button
Alert button
Oct 27, 2022
Michael L. Littman, Ifeoma Ajunwa, Guy Berger, Craig Boutilier, Morgan Currie, Finale Doshi-Velez, Gillian Hadfield, Michael C. Horowitz, Charles Isbell, Hiroaki Kitano, Karen Levy, Terah Lyons, Melanie Mitchell, Julie Shah, Steven Sloman, Shannon Vallor, Toby Walsh

Viaarxiv icon

Designing Rewards for Fast Learning

Add code
Bookmark button
Alert button
May 30, 2022
Henry Sowerby, Zhiyuan Zhou, Michael L. Littman

Figure 1 for Designing Rewards for Fast Learning
Figure 2 for Designing Rewards for Fast Learning
Figure 3 for Designing Rewards for Fast Learning
Figure 4 for Designing Rewards for Fast Learning
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Bookmark button
Alert button
Dec 10, 2021
Kavosh Asadi, Rasool Fakoor, Omer Gottesman, Michael L. Littman, Alexander J. Smola

Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

On the Expressivity of Markov Reward

Add code
Bookmark button
Alert button
Nov 01, 2021
David Abel, Will Dabney, Anna Harutyunyan, Mark K. Ho, Michael L. Littman, Doina Precup, Satinder Singh

Figure 1 for On the Expressivity of Markov Reward
Figure 2 for On the Expressivity of Markov Reward
Figure 3 for On the Expressivity of Markov Reward
Figure 4 for On the Expressivity of Markov Reward
Viaarxiv icon

Bad-Policy Density: A Measure of Reinforcement Learning Hardness

Add code
Bookmark button
Alert button
Oct 07, 2021
David Abel, Cameron Allen, Dilip Arumugam, D. Ellis Hershkowitz, Michael L. Littman, Lawson L. S. Wong

Figure 1 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 2 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 3 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Figure 4 for Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Viaarxiv icon

Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback

Add code
Bookmark button
Alert button
Sep 15, 2021
Ishaan Shah, David Halpern, Kavosh Asadi, Michael L. Littman

Figure 1 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Figure 2 for Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback
Viaarxiv icon