Picture for Mehdi Dastani

Mehdi Dastani

Neuro-symbolic Action Masking for Deep Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

Pushdown Reward Machines for Reinforcement Learning

Add code
Aug 09, 2025
Figure 1 for Pushdown Reward Machines for Reinforcement Learning
Figure 2 for Pushdown Reward Machines for Reinforcement Learning
Figure 3 for Pushdown Reward Machines for Reinforcement Learning
Figure 4 for Pushdown Reward Machines for Reinforcement Learning
Viaarxiv icon

Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning

Add code
May 13, 2025
Figure 1 for Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Figure 2 for Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Figure 3 for Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Figure 4 for Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning
Viaarxiv icon

Causes and Strategies in Multiagent Systems

Add code
Feb 19, 2025
Figure 1 for Causes and Strategies in Multiagent Systems
Figure 2 for Causes and Strategies in Multiagent Systems
Figure 3 for Causes and Strategies in Multiagent Systems
Viaarxiv icon

Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning

Add code
Feb 10, 2025
Figure 1 for Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning
Figure 2 for Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning
Figure 3 for Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning
Figure 4 for Reducing Variance Caused by Communication in Decentralized Multi-agent Deep Reinforcement Learning
Viaarxiv icon

The Minimal Search Space for Conditional Causal Bandits

Add code
Feb 10, 2025
Viaarxiv icon

Temporal Causal Reasoning with (Non-Recursive) Structural Equation Models

Add code
Jan 17, 2025
Figure 1 for Temporal Causal Reasoning with (Non-Recursive) Structural Equation Models
Figure 2 for Temporal Causal Reasoning with (Non-Recursive) Structural Equation Models
Figure 3 for Temporal Causal Reasoning with (Non-Recursive) Structural Equation Models
Figure 4 for Temporal Causal Reasoning with (Non-Recursive) Structural Equation Models
Viaarxiv icon

Optimal Causal Representations and the Causal Information Bottleneck

Add code
Oct 02, 2024
Figure 1 for Optimal Causal Representations and the Causal Information Bottleneck
Figure 2 for Optimal Causal Representations and the Causal Information Bottleneck
Figure 3 for Optimal Causal Representations and the Causal Information Bottleneck
Figure 4 for Optimal Causal Representations and the Causal Information Bottleneck
Viaarxiv icon

Maximally Permissive Reward Machines

Add code
Aug 15, 2024
Figure 1 for Maximally Permissive Reward Machines
Figure 2 for Maximally Permissive Reward Machines
Figure 3 for Maximally Permissive Reward Machines
Figure 4 for Maximally Permissive Reward Machines
Viaarxiv icon

Cooperative Multi-agent Approach for Automated Computer Game Testing

Add code
May 18, 2024
Figure 1 for Cooperative Multi-agent Approach for Automated Computer Game Testing
Figure 2 for Cooperative Multi-agent Approach for Automated Computer Game Testing
Figure 3 for Cooperative Multi-agent Approach for Automated Computer Game Testing
Figure 4 for Cooperative Multi-agent Approach for Automated Computer Game Testing
Viaarxiv icon