Picture for Collin Burns

Collin Burns

Unsupervised Elicitation of Language Models

Add code
Jun 11, 2025
Figure 1 for Unsupervised Elicitation of Language Models
Figure 2 for Unsupervised Elicitation of Language Models
Figure 3 for Unsupervised Elicitation of Language Models
Figure 4 for Unsupervised Elicitation of Language Models
Viaarxiv icon

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Figure 1 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 2 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 3 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Figure 4 for Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Viaarxiv icon

Discovering Latent Knowledge in Language Models Without Supervision

Add code
Dec 07, 2022
Figure 1 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 2 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 3 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 4 for Discovering Latent Knowledge in Language Models Without Supervision
Viaarxiv icon

Measuring Coding Challenge Competence With APPS

Add code
May 27, 2021
Figure 1 for Measuring Coding Challenge Competence With APPS
Figure 2 for Measuring Coding Challenge Competence With APPS
Figure 3 for Measuring Coding Challenge Competence With APPS
Figure 4 for Measuring Coding Challenge Competence With APPS
Viaarxiv icon

CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review

Add code
Mar 10, 2021
Figure 1 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 2 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 3 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 4 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Viaarxiv icon

Limitations of Post-Hoc Feature Alignment for Robustness

Add code
Mar 10, 2021
Figure 1 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 2 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 3 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 4 for Limitations of Post-Hoc Feature Alignment for Robustness
Viaarxiv icon

Measuring Mathematical Problem Solving With the MATH Dataset

Add code
Mar 05, 2021
Figure 1 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 2 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 3 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 4 for Measuring Mathematical Problem Solving With the MATH Dataset
Viaarxiv icon

Measuring Massive Multitask Language Understanding

Add code
Sep 21, 2020
Figure 1 for Measuring Massive Multitask Language Understanding
Figure 2 for Measuring Massive Multitask Language Understanding
Figure 3 for Measuring Massive Multitask Language Understanding
Figure 4 for Measuring Massive Multitask Language Understanding
Viaarxiv icon

Aligning AI With Shared Human Values

Add code
Aug 05, 2020
Figure 1 for Aligning AI With Shared Human Values
Figure 2 for Aligning AI With Shared Human Values
Figure 3 for Aligning AI With Shared Human Values
Figure 4 for Aligning AI With Shared Human Values
Viaarxiv icon

Streaming Complexity of SVMs

Add code
Jul 07, 2020
Viaarxiv icon