Alert button
Picture for Bowen Baker

Bowen Baker

Alert button

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Bookmark button
Alert button
Dec 14, 2023
Collin Burns, Pavel Izmailov, Jan Hendrik Kirchner, Bowen Baker, Leo Gao, Leopold Aschenbrenner, Yining Chen, Adrien Ecoffet, Manas Joglekar, Jan Leike, Ilya Sutskever, Jeff Wu

Viaarxiv icon

Let's Verify Step by Step

Add code
Bookmark button
Alert button
May 31, 2023
Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe

Figure 1 for Let's Verify Step by Step
Figure 2 for Let's Verify Step by Step
Figure 3 for Let's Verify Step by Step
Figure 4 for Let's Verify Step by Step
Viaarxiv icon

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Add code
Bookmark button
Alert button
Jun 23, 2022
Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang, Adrien Ecoffet, Brandon Houghton, Raul Sampedro, Jeff Clune

Figure 1 for Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Figure 2 for Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Figure 3 for Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Figure 4 for Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Viaarxiv icon

Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

Add code
Bookmark button
Alert button
Jun 28, 2021
Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

Figure 1 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 2 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 3 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Figure 4 for Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft
Viaarxiv icon

Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences

Add code
Bookmark button
Alert button
Nov 10, 2020
Bowen Baker

Figure 1 for Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Figure 2 for Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Figure 3 for Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Figure 4 for Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Viaarxiv icon

Emergent Tool Use From Multi-Agent Autocurricula

Add code
Bookmark button
Alert button
Sep 17, 2019
Bowen Baker, Ingmar Kanitscheider, Todor Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch

Figure 1 for Emergent Tool Use From Multi-Agent Autocurricula
Figure 2 for Emergent Tool Use From Multi-Agent Autocurricula
Figure 3 for Emergent Tool Use From Multi-Agent Autocurricula
Figure 4 for Emergent Tool Use From Multi-Agent Autocurricula
Viaarxiv icon

Learning Dexterous In-Hand Manipulation

Add code
Bookmark button
Alert button
Jan 18, 2019
OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

Figure 1 for Learning Dexterous In-Hand Manipulation
Figure 2 for Learning Dexterous In-Hand Manipulation
Figure 3 for Learning Dexterous In-Hand Manipulation
Figure 4 for Learning Dexterous In-Hand Manipulation
Viaarxiv icon

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

Add code
Bookmark button
Alert button
Mar 10, 2018
Matthias Plappert, Marcin Andrychowicz, Alex Ray, Bob McGrew, Bowen Baker, Glenn Powell, Jonas Schneider, Josh Tobin, Maciek Chociej, Peter Welinder, Vikash Kumar, Wojciech Zaremba

Figure 1 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 2 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 3 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 4 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Viaarxiv icon

Accelerating Neural Architecture Search using Performance Prediction

Add code
Bookmark button
Alert button
Nov 08, 2017
Bowen Baker, Otkrist Gupta, Ramesh Raskar, Nikhil Naik

Figure 1 for Accelerating Neural Architecture Search using Performance Prediction
Figure 2 for Accelerating Neural Architecture Search using Performance Prediction
Figure 3 for Accelerating Neural Architecture Search using Performance Prediction
Figure 4 for Accelerating Neural Architecture Search using Performance Prediction
Viaarxiv icon