Picture for Xiaotian Hao

Xiaotian Hao

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

Add code
Mar 16, 2022
Figure 1 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 2 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 3 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 4 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Viaarxiv icon

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks

Add code
Mar 10, 2022
Figure 1 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 2 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 3 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Figure 4 for API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks
Viaarxiv icon

SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition

Add code
Nov 17, 2021
Figure 1 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 2 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 3 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Figure 4 for SEIHAI: A Sample-efficient Hierarchical AI for the MineRL Competition
Viaarxiv icon

Towards robust and domain agnostic reinforcement learning competitions

Add code
Jun 07, 2021
Figure 1 for Towards robust and domain agnostic reinforcement learning competitions
Figure 2 for Towards robust and domain agnostic reinforcement learning competitions
Figure 3 for Towards robust and domain agnostic reinforcement learning competitions
Figure 4 for Towards robust and domain agnostic reinforcement learning competitions
Viaarxiv icon

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Add code
Jun 29, 2020
Figure 1 for Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Figure 2 for Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Figure 3 for Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Figure 4 for Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Viaarxiv icon

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

Add code
May 12, 2020
Figure 1 for Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
Figure 2 for Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
Figure 3 for Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
Figure 4 for Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
Viaarxiv icon

From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

Add code
Sep 06, 2019
Figure 1 for From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
Figure 2 for From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
Figure 3 for From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
Figure 4 for From Few to More: Large-scale Dynamic Multiagent Curriculum Learning
Viaarxiv icon

Action Semantics Network: Considering the Effects of Actions in Multiagent Systems

Add code
Jul 26, 2019
Figure 1 for Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
Figure 2 for Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
Figure 3 for Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
Figure 4 for Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
Viaarxiv icon