Picture for Stephanie Milani

Stephanie Milani

The PokeAgent Challenge: Competitive and Long-Context Learning at Scale

Add code
Mar 17, 2026
Viaarxiv icon

Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels

Add code
Jul 22, 2024
Figure 1 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 2 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 3 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Figure 4 for Concept-Based Interpretable Reinforcement Learning with Limited to No Human Labels
Viaarxiv icon

Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent

Add code
Jul 16, 2024
Figure 1 for Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Figure 2 for Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Figure 3 for Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Figure 4 for Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent
Viaarxiv icon

Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction

Add code
Jun 11, 2024
Viaarxiv icon

PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals

Add code
May 30, 2024
Viaarxiv icon

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Add code
Dec 05, 2023
Figure 1 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 2 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 3 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Figure 4 for BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks
Viaarxiv icon

Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning

Add code
Apr 12, 2023
Figure 1 for Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning
Figure 2 for Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning
Figure 3 for Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning
Figure 4 for Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning
Viaarxiv icon

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Add code
Mar 23, 2023
Figure 1 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 2 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 3 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Figure 4 for Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition
Viaarxiv icon

Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games

Add code
Mar 02, 2023
Figure 1 for Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Figure 2 for Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Figure 3 for Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Figure 4 for Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games
Viaarxiv icon

UniMASK: Unified Inference in Sequential Decision Problems

Add code
Nov 20, 2022
Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon