Picture for Marc Lanctot

Marc Lanctot

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines

Add code
Sep 09, 2018
Figure 1 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 2 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 3 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Figure 4 for Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines
Viaarxiv icon

Emergent Communication through Negotiation

Add code
Apr 11, 2018
Figure 1 for Emergent Communication through Negotiation
Figure 2 for Emergent Communication through Negotiation
Figure 3 for Emergent Communication through Negotiation
Figure 4 for Emergent Communication through Negotiation
Viaarxiv icon

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Add code
Dec 05, 2017
Figure 1 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 2 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 3 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Figure 4 for Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
Viaarxiv icon

Deep Q-learning from Demonstrations

Add code
Nov 22, 2017
Figure 1 for Deep Q-learning from Demonstrations
Figure 2 for Deep Q-learning from Demonstrations
Figure 3 for Deep Q-learning from Demonstrations
Viaarxiv icon

A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning

Add code
Nov 07, 2017
Figure 1 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 2 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 3 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Figure 4 for A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Viaarxiv icon

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Add code
Jun 16, 2017
Figure 1 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Figure 2 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Viaarxiv icon

Memory-Efficient Backpropagation Through Time

Add code
Jun 10, 2016
Figure 1 for Memory-Efficient Backpropagation Through Time
Figure 2 for Memory-Efficient Backpropagation Through Time
Figure 3 for Memory-Efficient Backpropagation Through Time
Figure 4 for Memory-Efficient Backpropagation Through Time
Viaarxiv icon

Convolution by Evolution: Differentiable Pattern Producing Networks

Add code
Jun 08, 2016
Figure 1 for Convolution by Evolution: Differentiable Pattern Producing Networks
Figure 2 for Convolution by Evolution: Differentiable Pattern Producing Networks
Figure 3 for Convolution by Evolution: Differentiable Pattern Producing Networks
Figure 4 for Convolution by Evolution: Differentiable Pattern Producing Networks
Viaarxiv icon

Dueling Network Architectures for Deep Reinforcement Learning

Add code
Apr 05, 2016
Figure 1 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 2 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 3 for Dueling Network Architectures for Deep Reinforcement Learning
Figure 4 for Dueling Network Architectures for Deep Reinforcement Learning
Viaarxiv icon

Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups

Add code
Jun 19, 2014
Figure 1 for Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups
Figure 2 for Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups
Figure 3 for Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups
Figure 4 for Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups
Viaarxiv icon