Picture for Jérémy Scheurer

Jérémy Scheurer

Black-Box Access is Insufficient for Rigorous AI Audits

Add code
Jan 25, 2024
Figure 1 for Black-Box Access is Insufficient for Rigorous AI Audits
Figure 2 for Black-Box Access is Insufficient for Rigorous AI Audits
Figure 3 for Black-Box Access is Insufficient for Rigorous AI Audits
Viaarxiv icon

Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

Add code
Nov 27, 2023
Figure 1 for Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Figure 2 for Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Figure 3 for Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Figure 4 for Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Viaarxiv icon

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Add code
Jul 27, 2023
Figure 1 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 2 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 3 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Figure 4 for Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Viaarxiv icon

Training Language Models with Language Feedback at Scale

Add code
Apr 09, 2023
Figure 1 for Training Language Models with Language Feedback at Scale
Figure 2 for Training Language Models with Language Feedback at Scale
Figure 3 for Training Language Models with Language Feedback at Scale
Figure 4 for Training Language Models with Language Feedback at Scale
Viaarxiv icon

Improving Code Generation by Training with Natural Language Feedback

Add code
Mar 28, 2023
Figure 1 for Improving Code Generation by Training with Natural Language Feedback
Figure 2 for Improving Code Generation by Training with Natural Language Feedback
Figure 3 for Improving Code Generation by Training with Natural Language Feedback
Figure 4 for Improving Code Generation by Training with Natural Language Feedback
Viaarxiv icon

Few-shot Adaptation Works with UnpredicTable Data

Add code
Aug 08, 2022
Figure 1 for Few-shot Adaptation Works with UnpredicTable Data
Figure 2 for Few-shot Adaptation Works with UnpredicTable Data
Figure 3 for Few-shot Adaptation Works with UnpredicTable Data
Figure 4 for Few-shot Adaptation Works with UnpredicTable Data
Viaarxiv icon

Training Language Models with Natural Language Feedback

Add code
May 02, 2022
Figure 1 for Training Language Models with Natural Language Feedback
Figure 2 for Training Language Models with Natural Language Feedback
Figure 3 for Training Language Models with Natural Language Feedback
Figure 4 for Training Language Models with Natural Language Feedback
Viaarxiv icon

Instance-wise algorithm configuration with graph neural networks

Add code
Feb 10, 2022
Figure 1 for Instance-wise algorithm configuration with graph neural networks
Figure 2 for Instance-wise algorithm configuration with graph neural networks
Figure 3 for Instance-wise algorithm configuration with graph neural networks
Figure 4 for Instance-wise algorithm configuration with graph neural networks
Viaarxiv icon

Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema

Add code
Sep 10, 2020
Figure 1 for Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema
Figure 2 for Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema
Figure 3 for Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema
Figure 4 for Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema
Viaarxiv icon