Picture for Alexander Nikulin

Alexander Nikulin

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Add code
Jun 13, 2024
Viaarxiv icon

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Add code
Feb 05, 2024
Viaarxiv icon

In-Context Reinforcement Learning for Variable Action Spaces

Add code
Dec 20, 2023
Viaarxiv icon

Emergence of In-Context Reinforcement Learning from Noise Distillation

Add code
Dec 19, 2023
Viaarxiv icon

XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Add code
Dec 19, 2023
Figure 1 for XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Figure 2 for XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Figure 3 for XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Figure 4 for XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
Viaarxiv icon

Katakomba: Tools and Benchmarks for Data-Driven NetHack

Add code
Jun 14, 2023
Figure 1 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 2 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 3 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 4 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Viaarxiv icon

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Add code
May 16, 2023
Figure 1 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 2 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 3 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 4 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Viaarxiv icon

Anti-Exploration by Random Network Distillation

Add code
Jan 31, 2023
Figure 1 for Anti-Exploration by Random Network Distillation
Figure 2 for Anti-Exploration by Random Network Distillation
Figure 3 for Anti-Exploration by Random Network Distillation
Figure 4 for Anti-Exploration by Random Network Distillation
Viaarxiv icon

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Add code
Nov 20, 2022
Figure 1 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 2 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 3 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 4 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Viaarxiv icon

Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows

Add code
Nov 20, 2022
Figure 1 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 2 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 3 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 4 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Viaarxiv icon