Picture for Aleksandr I. Panov

Aleksandr I. Panov

Recurrent Memory Decision Transformer

Add code
Jul 05, 2023
Figure 1 for Recurrent Memory Decision Transformer
Figure 2 for Recurrent Memory Decision Transformer
Figure 3 for Recurrent Memory Decision Transformer
Figure 4 for Recurrent Memory Decision Transformer
Viaarxiv icon

Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

Add code
Jan 24, 2023
Viaarxiv icon

HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images

Add code
Dec 30, 2022
Viaarxiv icon

POGEMA: Partially Observable Grid Environment for Multiple Agents

Add code
Jun 22, 2022
Figure 1 for POGEMA: Partially Observable Grid Environment for Multiple Agents
Figure 2 for POGEMA: Partially Observable Grid Environment for Multiple Agents
Figure 3 for POGEMA: Partially Observable Grid Environment for Multiple Agents
Figure 4 for POGEMA: Partially Observable Grid Environment for Multiple Agents
Viaarxiv icon

IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents

Add code
May 31, 2022
Figure 1 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Figure 2 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Figure 3 for IGLU Gridworld: Simple and Fast Environment for Embodied Dialog Agents
Viaarxiv icon

Multitask Adaptation by Retrospective Exploration with Learned World Models

Add code
Oct 25, 2021
Figure 1 for Multitask Adaptation by Retrospective Exploration with Learned World Models
Figure 2 for Multitask Adaptation by Retrospective Exploration with Learned World Models
Figure 3 for Multitask Adaptation by Retrospective Exploration with Learned World Models
Figure 4 for Multitask Adaptation by Retrospective Exploration with Learned World Models
Viaarxiv icon

Long-Term Exploration in Persistent MDPs

Add code
Sep 21, 2021
Figure 1 for Long-Term Exploration in Persistent MDPs
Figure 2 for Long-Term Exploration in Persistent MDPs
Figure 3 for Long-Term Exploration in Persistent MDPs
Figure 4 for Long-Term Exploration in Persistent MDPs
Viaarxiv icon

Landmark Policy Optimization for Object Navigation Task

Add code
Sep 17, 2021
Figure 1 for Landmark Policy Optimization for Object Navigation Task
Figure 2 for Landmark Policy Optimization for Object Navigation Task
Figure 3 for Landmark Policy Optimization for Object Navigation Task
Figure 4 for Landmark Policy Optimization for Object Navigation Task
Viaarxiv icon

Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments

Add code
Aug 13, 2021
Figure 1 for Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments
Figure 2 for Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments
Figure 3 for Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments
Figure 4 for Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments
Viaarxiv icon

Delta Schema Network in Model-based Reinforcement Learning

Add code
Jul 08, 2020
Figure 1 for Delta Schema Network in Model-based Reinforcement Learning
Figure 2 for Delta Schema Network in Model-based Reinforcement Learning
Figure 3 for Delta Schema Network in Model-based Reinforcement Learning
Viaarxiv icon