Picture for Collin Burns

Collin Burns

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Viaarxiv icon

Discovering Latent Knowledge in Language Models Without Supervision

Add code
Dec 07, 2022
Figure 1 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 2 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 3 for Discovering Latent Knowledge in Language Models Without Supervision
Figure 4 for Discovering Latent Knowledge in Language Models Without Supervision
Viaarxiv icon

Measuring Coding Challenge Competence With APPS

Add code
May 27, 2021
Figure 1 for Measuring Coding Challenge Competence With APPS
Figure 2 for Measuring Coding Challenge Competence With APPS
Figure 3 for Measuring Coding Challenge Competence With APPS
Figure 4 for Measuring Coding Challenge Competence With APPS
Viaarxiv icon

CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review

Add code
Mar 10, 2021
Figure 1 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 2 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 3 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Figure 4 for CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review
Viaarxiv icon

Limitations of Post-Hoc Feature Alignment for Robustness

Add code
Mar 10, 2021
Figure 1 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 2 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 3 for Limitations of Post-Hoc Feature Alignment for Robustness
Figure 4 for Limitations of Post-Hoc Feature Alignment for Robustness
Viaarxiv icon

Measuring Mathematical Problem Solving With the MATH Dataset

Add code
Mar 05, 2021
Figure 1 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 2 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 3 for Measuring Mathematical Problem Solving With the MATH Dataset
Figure 4 for Measuring Mathematical Problem Solving With the MATH Dataset
Viaarxiv icon

Measuring Massive Multitask Language Understanding

Add code
Sep 21, 2020
Figure 1 for Measuring Massive Multitask Language Understanding
Figure 2 for Measuring Massive Multitask Language Understanding
Figure 3 for Measuring Massive Multitask Language Understanding
Figure 4 for Measuring Massive Multitask Language Understanding
Viaarxiv icon

Aligning AI With Shared Human Values

Add code
Aug 05, 2020
Figure 1 for Aligning AI With Shared Human Values
Figure 2 for Aligning AI With Shared Human Values
Figure 3 for Aligning AI With Shared Human Values
Figure 4 for Aligning AI With Shared Human Values
Viaarxiv icon

Streaming Complexity of SVMs

Add code
Jul 07, 2020
Viaarxiv icon

Interpreting Black Box Models with Statistical Guarantees

Add code
Mar 29, 2019
Figure 1 for Interpreting Black Box Models with Statistical Guarantees
Figure 2 for Interpreting Black Box Models with Statistical Guarantees
Figure 3 for Interpreting Black Box Models with Statistical Guarantees
Figure 4 for Interpreting Black Box Models with Statistical Guarantees
Viaarxiv icon