Picture for Thomy Phan

Thomy Phan

LMU Munich

Multi-Agent Path Finding Among Dynamic Uncontrollable Agents with Statistical Safety Guarantees

Add code
Jul 29, 2025
Viaarxiv icon

New Mechanisms in Flex Distribution for Bounded Suboptimal Multi-Agent Path Finding

Add code
Jul 22, 2025
Viaarxiv icon

Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic

Add code
Aug 06, 2024
Viaarxiv icon

Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization

Add code
Jul 30, 2024
Figure 1 for Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization
Figure 2 for Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization
Figure 3 for Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization
Figure 4 for Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization
Viaarxiv icon

Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms

Add code
Jan 13, 2024
Figure 1 for Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
Figure 2 for Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
Figure 3 for Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
Figure 4 for Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms
Viaarxiv icon

ClusterComm: Discrete Communication in Decentralized MARL using Internal Representation Clustering

Add code
Jan 07, 2024
Viaarxiv icon

Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search

Add code
Jan 01, 2024
Figure 1 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 2 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 3 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Figure 4 for Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Viaarxiv icon

Challenges for Reinforcement Learning in Quantum Computing

Add code
Dec 18, 2023
Viaarxiv icon

Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization

Add code
Nov 09, 2023
Viaarxiv icon

CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing

Add code
Apr 26, 2023
Figure 1 for CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Figure 2 for CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Figure 3 for CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Figure 4 for CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing
Viaarxiv icon