Alert button
Picture for Jacob Pfau

Jacob Pfau

Alert button

Self-Consistency of Large Language Models under Ambiguity

Add code
Bookmark button
Alert button
Oct 20, 2023
Henning Bartsch, Ole Jorgensen, Domenic Rosati, Jason Hoelscher-Obermaier, Jacob Pfau

Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 27, 2023
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Wang, Samuel Marks, Charbel-Raphaël Segerie, Micah Carroll, Andi Peng, Phillip Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell

Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Objective Robustness in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 08, 2021
Jack Koch, Lauro Langosco, Jacob Pfau, James Le, Lee Sharkey

Figure 1 for Objective Robustness in Deep Reinforcement Learning
Figure 2 for Objective Robustness in Deep Reinforcement Learning
Figure 3 for Objective Robustness in Deep Reinforcement Learning
Figure 4 for Objective Robustness in Deep Reinforcement Learning
Viaarxiv icon

Robust Semantic Interpretability: Revisiting Concept Activation Vectors

Add code
Bookmark button
Alert button
Apr 06, 2021
Jacob Pfau, Albert T. Young, Jerome Wei, Maria L. Wei, Michael J. Keiser

Figure 1 for Robust Semantic Interpretability: Revisiting Concept Activation Vectors
Figure 2 for Robust Semantic Interpretability: Revisiting Concept Activation Vectors
Figure 3 for Robust Semantic Interpretability: Revisiting Concept Activation Vectors
Figure 4 for Robust Semantic Interpretability: Revisiting Concept Activation Vectors
Viaarxiv icon

Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias

Add code
Bookmark button
Alert button
Oct 16, 2019
Jacob Pfau, Albert T. Young, Maria L. Wei, Michael J. Keiser

Figure 1 for Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias
Figure 2 for Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias
Figure 3 for Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias
Figure 4 for Global Saliency: Aggregating Saliency Maps to Assess Dataset Artefact Bias
Viaarxiv icon