Picture for Bart Selman

Bart Selman

Policy-Value Alignment and Robustness in Search-based Multi-Agent Learning

Add code
Feb 06, 2023
Figure 1 for Policy-Value Alignment and Robustness in Search-based Multi-Agent Learning
Figure 2 for Policy-Value Alignment and Robustness in Search-based Multi-Agent Learning
Figure 3 for Policy-Value Alignment and Robustness in Search-based Multi-Agent Learning
Figure 4 for Policy-Value Alignment and Robustness in Search-based Multi-Agent Learning
Viaarxiv icon

Graph Value Iteration

Add code
Sep 20, 2022
Figure 1 for Graph Value Iteration
Figure 2 for Graph Value Iteration
Figure 3 for Graph Value Iteration
Figure 4 for Graph Value Iteration
Viaarxiv icon

Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning

Add code
Jun 28, 2022
Figure 1 for Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning
Figure 2 for Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning
Figure 3 for Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning
Figure 4 for Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning
Viaarxiv icon

A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances

Add code
Oct 03, 2021
Figure 1 for A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances
Figure 2 for A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances
Figure 3 for A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances
Figure 4 for A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances
Viaarxiv icon

Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning

Add code
Aug 21, 2021
Figure 1 for Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning
Figure 2 for Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning
Figure 3 for Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning
Figure 4 for Automating Crystal-Structure Phase Mapping: Combining Deep Learning with Constraint Reasoning
Viaarxiv icon

Structure Amplification on Multi-layer Stochastic Block Models

Add code
Jul 31, 2021
Figure 1 for Structure Amplification on Multi-layer Stochastic Block Models
Figure 2 for Structure Amplification on Multi-layer Stochastic Block Models
Figure 3 for Structure Amplification on Multi-layer Stochastic Block Models
Figure 4 for Structure Amplification on Multi-layer Stochastic Block Models
Viaarxiv icon

Curriculum-Driven Multi-Agent Learning and the Role of Implicit Communication in Teamwork

Add code
Jun 21, 2021
Figure 1 for Curriculum-Driven Multi-Agent Learning and the Role of Implicit Communication in Teamwork
Figure 2 for Curriculum-Driven Multi-Agent Learning and the Role of Implicit Communication in Teamwork
Figure 3 for Curriculum-Driven Multi-Agent Learning and the Role of Implicit Communication in Teamwork
Figure 4 for Curriculum-Driven Multi-Agent Learning and the Role of Implicit Communication in Teamwork
Viaarxiv icon

Fairness for Cooperative Multi-Agent Learning with Equivariant Policies

Add code
Jun 10, 2021
Figure 1 for Fairness for Cooperative Multi-Agent Learning with Equivariant Policies
Figure 2 for Fairness for Cooperative Multi-Agent Learning with Equivariant Policies
Figure 3 for Fairness for Cooperative Multi-Agent Learning with Equivariant Policies
Figure 4 for Fairness for Cooperative Multi-Agent Learning with Equivariant Policies
Viaarxiv icon

Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems

Add code
Dec 08, 2020
Figure 1 for Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems
Figure 2 for Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems
Figure 3 for Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems
Figure 4 for Low-Bandwidth Communication Emerges Naturally in Multi-Agent Learning Systems
Viaarxiv icon

Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning

Add code
Jun 04, 2020
Figure 1 for Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning
Figure 2 for Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning
Figure 3 for Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning
Figure 4 for Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning
Viaarxiv icon