Alert button
Picture for Anton Bakhtin

Anton Bakhtin

Alert button

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Add code
Bookmark button
Alert button
Jun 28, 2023
Esin Durmus, Karina Nyugen, Thomas I. Liao, Nicholas Schiefer, Amanda Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa Sikder, Alex Tamkin, Janel Thamkul, Jared Kaplan, Jack Clark, Deep Ganguli

Figure 1 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 2 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 3 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Figure 4 for Towards Measuring the Representation of Subjective Global Opinions in Language Models
Viaarxiv icon

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Add code
Bookmark button
Alert button
Oct 11, 2022
Anton Bakhtin, David J Wu, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H Miller, Noam Brown

Figure 1 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 2 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 3 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Figure 4 for Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning
Viaarxiv icon

Modeling Strong and Human-Like Gameplay with KL-Regularized Search

Add code
Bookmark button
Alert button
Dec 14, 2021
Athul Paul Jacob, David J. Wu, Gabriele Farina, Adam Lerer, Anton Bakhtin, Jacob Andreas, Noam Brown

Figure 1 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 2 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 3 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Figure 4 for Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Viaarxiv icon

No-Press Diplomacy from Scratch

Add code
Bookmark button
Alert button
Oct 06, 2021
Anton Bakhtin, David Wu, Adam Lerer, Noam Brown

Figure 1 for No-Press Diplomacy from Scratch
Figure 2 for No-Press Diplomacy from Scratch
Figure 3 for No-Press Diplomacy from Scratch
Figure 4 for No-Press Diplomacy from Scratch
Viaarxiv icon

Physical Reasoning Using Dynamics-Aware Models

Add code
Bookmark button
Alert button
Feb 20, 2021
Eltayeb Ahmed, Anton Bakhtin, Laurens van der Maaten, Rohit Girdhar

Figure 1 for Physical Reasoning Using Dynamics-Aware Models
Figure 2 for Physical Reasoning Using Dynamics-Aware Models
Figure 3 for Physical Reasoning Using Dynamics-Aware Models
Figure 4 for Physical Reasoning Using Dynamics-Aware Models
Viaarxiv icon

Human-Level Performance in No-Press Diplomacy via Equilibrium Search

Add code
Bookmark button
Alert button
Oct 06, 2020
Jonathan Gray, Adam Lerer, Anton Bakhtin, Noam Brown

Figure 1 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 2 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 3 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Figure 4 for Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Viaarxiv icon

Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

Add code
Bookmark button
Alert button
Jul 27, 2020
Noam Brown, Anton Bakhtin, Adam Lerer, Qucheng Gong

Figure 1 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 2 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 3 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Figure 4 for Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Viaarxiv icon

Residual Energy-Based Models for Text Generation

Add code
Bookmark button
Alert button
Apr 22, 2020
Yuntian Deng, Anton Bakhtin, Myle Ott, Arthur Szlam, Marc'Aurelio Ranzato

Figure 1 for Residual Energy-Based Models for Text Generation
Figure 2 for Residual Energy-Based Models for Text Generation
Figure 3 for Residual Energy-Based Models for Text Generation
Figure 4 for Residual Energy-Based Models for Text Generation
Viaarxiv icon

Energy-Based Models for Text

Add code
Bookmark button
Alert button
Apr 06, 2020
Anton Bakhtin, Yuntian Deng, Sam Gross, Myle Ott, Marc'Aurelio Ranzato, Arthur Szlam

Figure 1 for Energy-Based Models for Text
Figure 2 for Energy-Based Models for Text
Figure 3 for Energy-Based Models for Text
Figure 4 for Energy-Based Models for Text
Viaarxiv icon

Language Models as Knowledge Bases?

Add code
Bookmark button
Alert button
Sep 04, 2019
Fabio Petroni, Tim Rocktäschel, Patrick Lewis, Anton Bakhtin, Yuxiang Wu, Alexander H. Miller, Sebastian Riedel

Figure 1 for Language Models as Knowledge Bases?
Figure 2 for Language Models as Knowledge Bases?
Figure 3 for Language Models as Knowledge Bases?
Figure 4 for Language Models as Knowledge Bases?
Viaarxiv icon