Alert button
Picture for David Krueger

David Krueger

Alert button

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Thinker: Learning to Plan and Act

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Chung, Ivan Anokhin, David Krueger

Figure 1 for Thinker: Learning to Plan and Act
Figure 2 for Thinker: Learning to Plan and Act
Figure 3 for Thinker: Learning to Plan and Act
Figure 4 for Thinker: Learning to Plan and Act
Viaarxiv icon

Investigating the Nature of 3D Generalization in Deep Neural Networks

Add code
Bookmark button
Alert button
Apr 19, 2023
Shoaib Ahmed Siddiqui, David Krueger, Thomas Breuel

Figure 1 for Investigating the Nature of 3D Generalization in Deep Neural Networks
Figure 2 for Investigating the Nature of 3D Generalization in Deep Neural Networks
Figure 3 for Investigating the Nature of 3D Generalization in Deep Neural Networks
Figure 4 for Investigating the Nature of 3D Generalization in Deep Neural Networks
Viaarxiv icon

Unifying Grokking and Double Descent

Add code
Bookmark button
Alert button
Mar 10, 2023
Xander Davies, Lauro Langosco, David Krueger

Figure 1 for Unifying Grokking and Double Descent
Figure 2 for Unifying Grokking and Double Descent
Figure 3 for Unifying Grokking and Double Descent
Figure 4 for Unifying Grokking and Double Descent
Viaarxiv icon

Blockwise Self-Supervised Learning at Scale

Add code
Bookmark button
Alert button
Feb 03, 2023
Shoaib Ahmed Siddiqui, David Krueger, Yann LeCun, Stéphane Deny

Figure 1 for Blockwise Self-Supervised Learning at Scale
Figure 2 for Blockwise Self-Supervised Learning at Scale
Figure 3 for Blockwise Self-Supervised Learning at Scale
Figure 4 for Blockwise Self-Supervised Learning at Scale
Viaarxiv icon

On The Fragility of Learned Reward Functions

Add code
Bookmark button
Alert button
Jan 09, 2023
Lev McKinney, Yawen Duan, David Krueger, Adam Gleave

Figure 1 for On The Fragility of Learned Reward Functions
Figure 2 for On The Fragility of Learned Reward Functions
Figure 3 for On The Fragility of Learned Reward Functions
Figure 4 for On The Fragility of Learned Reward Functions
Viaarxiv icon

Domain Generalization for Robust Model-Based Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 27, 2022
Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger

Figure 1 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 2 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 3 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Figure 4 for Domain Generalization for Robust Model-Based Offline Reinforcement Learning
Viaarxiv icon

Mechanistic Mode Connectivity

Add code
Bookmark button
Alert button
Nov 15, 2022
Ekdeep Singh Lubana, Eric J. Bigelow, Robert P. Dick, David Krueger, Hidenori Tanaka

Figure 1 for Mechanistic Mode Connectivity
Figure 2 for Mechanistic Mode Connectivity
Figure 3 for Mechanistic Mode Connectivity
Figure 4 for Mechanistic Mode Connectivity
Viaarxiv icon

Broken Neural Scaling Laws

Add code
Bookmark button
Alert button
Nov 10, 2022
Ethan Caballero, Kshitij Gupta, Irina Rish, David Krueger

Figure 1 for Broken Neural Scaling Laws
Figure 2 for Broken Neural Scaling Laws
Figure 3 for Broken Neural Scaling Laws
Figure 4 for Broken Neural Scaling Laws
Viaarxiv icon