Picture for Noam Brown

Noam Brown

Tony

Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

Add code
Jun 16, 2021
Figure 1 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 2 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 3 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Figure 4 for Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
Viaarxiv icon

Off-Belief Learning

Add code
Mar 06, 2021
Figure 1 for Off-Belief Learning
Figure 2 for Off-Belief Learning
Figure 3 for Off-Belief Learning
Figure 4 for Off-Belief Learning
Viaarxiv icon

Safe Search for Stackelberg Equilibria in Extensive-Form Games

Add code
Feb 02, 2021
Figure 1 for Safe Search for Stackelberg Equilibria in Extensive-Form Games
Figure 2 for Safe Search for Stackelberg Equilibria in Extensive-Form Games
Figure 3 for Safe Search for Stackelberg Equilibria in Extensive-Form Games
Figure 4 for Safe Search for Stackelberg Equilibria in Extensive-Form Games
Viaarxiv icon

Human-Level Performance in No-Press Diplomacy via Equilibrium Search

Add code
Oct 06, 2020
Figure 1 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 2 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 3 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 4 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Viaarxiv icon

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

Add code
Jul 27, 2020
Figure 1 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 2 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 3 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 4 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Viaarxiv icon

Unlocking the Potential of Deep Counterfactual Value Networks

Add code
Jul 20, 2020
Figure 1 for Unlocking the Potential of Deep Counterfactual Value Networks
Figure 2 for Unlocking the Potential of Deep Counterfactual Value Networks
Figure 3 for Unlocking the Potential of Deep Counterfactual Value Networks
Figure 4 for Unlocking the Potential of Deep Counterfactual Value Networks
Viaarxiv icon

DREAM: Deep Regret minimization with Advantage baselines and Model-free learning

Add code
Jun 18, 2020
Figure 1 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 2 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 3 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Figure 4 for DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Viaarxiv icon

Improving Policies via Search in Cooperative Partially Observable Games

Add code
Dec 05, 2019
Figure 1 for Improving Policies via Search in Cooperative Partially Observable Games
Figure 2 for Improving Policies via Search in Cooperative Partially Observable Games
Figure 3 for Improving Policies via Search in Cooperative Partially Observable Games
Figure 4 for Improving Policies via Search in Cooperative Partially Observable Games
Viaarxiv icon

Stable-Predictive Optimistic Counterfactual Regret Minimization

Add code
Feb 13, 2019
Figure 1 for Stable-Predictive Optimistic Counterfactual Regret Minimization
Figure 2 for Stable-Predictive Optimistic Counterfactual Regret Minimization
Viaarxiv icon

Deep Counterfactual Regret Minimization

Add code
Nov 01, 2018
Figure 1 for Deep Counterfactual Regret Minimization
Figure 2 for Deep Counterfactual Regret Minimization
Figure 3 for Deep Counterfactual Regret Minimization
Figure 4 for Deep Counterfactual Regret Minimization
Viaarxiv icon