Picture for Alexey Skrynnik

Alexey Skrynnik

Self-Guided Plan Extraction for Instruction-Following Tasks with Goal-Conditional Reinforcement Learning

Add code
Apr 22, 2026
Viaarxiv icon

MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning

Add code
Apr 07, 2026
Viaarxiv icon

Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

Add code
Mar 22, 2026
Viaarxiv icon

CoRL-MPPI: Enhancing MPPI With Learnable Behaviours For Efficient And Provably-Safe Multi-Robot Collision Avoidance

Add code
Nov 12, 2025
Viaarxiv icon

CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World

Add code
May 17, 2025
Viaarxiv icon

MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale

Add code
Aug 29, 2024
Viaarxiv icon

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

Add code
Jul 20, 2024
Viaarxiv icon

Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Add code
Jul 12, 2024
Figure 1 for Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Figure 2 for Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Figure 3 for Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Figure 4 for Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments
Viaarxiv icon

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Add code
Jul 12, 2024
Figure 1 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 2 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 3 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Figure 4 for IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Viaarxiv icon

Decentralized Monte Carlo Tree Search for Partially Observable Multi-agent Pathfinding

Add code
Dec 26, 2023
Viaarxiv icon