Minigrid


Latent Perspective-Taking via a Schrödinger Bridge in Influence-Augmented Local Models

Add code
Feb 02, 2026
Viaarxiv icon

Unsupervised Learning of Efficient Exploration: Pre-training Adaptive Policies via Self-Imposed Goals

Add code
Jan 27, 2026
Viaarxiv icon

Deep Intrinsic Surprise-Regularized Control (DISRC): A Biologically Inspired Mechanism for Efficient Deep Q-Learning in Sparse Environments

Add code
Jan 24, 2026
Viaarxiv icon

Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs

Add code
Nov 16, 2025
Viaarxiv icon

Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

Add code
Oct 30, 2025
Figure 1 for Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning
Figure 2 for Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning
Figure 3 for Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning
Figure 4 for Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning
Viaarxiv icon

Code-Driven Planning in Grid Worlds with Large Language Models

Add code
May 15, 2025
Viaarxiv icon

DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning

Add code
May 06, 2025
Figure 1 for DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning
Figure 2 for DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning
Figure 3 for DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning
Figure 4 for DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning
Viaarxiv icon

D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection

Add code
May 04, 2025
Figure 1 for D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Figure 2 for D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Figure 3 for D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Figure 4 for D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Viaarxiv icon

LLM-Guided Probabilistic Program Induction for POMDP Model Estimation

Add code
May 04, 2025
Figure 1 for LLM-Guided Probabilistic Program Induction for POMDP Model Estimation
Figure 2 for LLM-Guided Probabilistic Program Induction for POMDP Model Estimation
Figure 3 for LLM-Guided Probabilistic Program Induction for POMDP Model Estimation
Figure 4 for LLM-Guided Probabilistic Program Induction for POMDP Model Estimation
Viaarxiv icon

World Model Agents with Change-Based Intrinsic Motivation

Add code
Mar 26, 2025
Figure 1 for World Model Agents with Change-Based Intrinsic Motivation
Figure 2 for World Model Agents with Change-Based Intrinsic Motivation
Figure 3 for World Model Agents with Change-Based Intrinsic Motivation
Figure 4 for World Model Agents with Change-Based Intrinsic Motivation
Viaarxiv icon