Picture for Michael L. Littman

Michael L. Littman

Rutgers University

Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy

Add code
Jul 10, 2024
Viaarxiv icon

Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages

Add code
Jul 03, 2024
Figure 1 for Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages
Figure 2 for Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages
Figure 3 for Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages
Figure 4 for Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages
Viaarxiv icon

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Add code
Jan 18, 2023
Figure 1 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 2 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 3 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Figure 4 for A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems
Viaarxiv icon

Specifying Behavior Preference with Tiered Reward Functions

Add code
Dec 07, 2022
Figure 1 for Specifying Behavior Preference with Tiered Reward Functions
Figure 2 for Specifying Behavior Preference with Tiered Reward Functions
Figure 3 for Specifying Behavior Preference with Tiered Reward Functions
Figure 4 for Specifying Behavior Preference with Tiered Reward Functions
Viaarxiv icon

Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex

Add code
Nov 26, 2022
Figure 1 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 2 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 3 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Figure 4 for Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Viaarxiv icon

Reward-Predictive Clustering

Add code
Nov 07, 2022
Figure 1 for Reward-Predictive Clustering
Figure 2 for Reward-Predictive Clustering
Figure 3 for Reward-Predictive Clustering
Figure 4 for Reward-Predictive Clustering
Viaarxiv icon

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report

Add code
Oct 27, 2022
Viaarxiv icon

Designing Rewards for Fast Learning

Add code
May 30, 2022
Figure 1 for Designing Rewards for Fast Learning
Figure 2 for Designing Rewards for Fast Learning
Figure 3 for Designing Rewards for Fast Learning
Figure 4 for Designing Rewards for Fast Learning
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Dec 10, 2021
Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

On the Expressivity of Markov Reward

Add code
Nov 01, 2021
Figure 1 for On the Expressivity of Markov Reward
Figure 2 for On the Expressivity of Markov Reward
Figure 3 for On the Expressivity of Markov Reward
Figure 4 for On the Expressivity of Markov Reward
Viaarxiv icon