Picture for Owain Evans

Owain Evans

Shammie

Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs

Add code
Jul 05, 2024
Viaarxiv icon

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Add code
Jun 20, 2024
Viaarxiv icon

Can Language Models Explain Their Own Classification Behavior?

Add code
May 13, 2024
Viaarxiv icon

Tell, don't show: Declarative facts influence how LLMs generalize

Add code
Dec 12, 2023
Viaarxiv icon

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Add code
Sep 26, 2023
Viaarxiv icon

The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"

Add code
Sep 22, 2023
Figure 1 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 2 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 3 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Figure 4 for The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
Viaarxiv icon

Taken out of context: On measuring situational awareness in LLMs

Add code
Sep 01, 2023
Figure 1 for Taken out of context: On measuring situational awareness in LLMs
Figure 2 for Taken out of context: On measuring situational awareness in LLMs
Figure 3 for Taken out of context: On measuring situational awareness in LLMs
Figure 4 for Taken out of context: On measuring situational awareness in LLMs
Viaarxiv icon

Forecasting Future World Events with Neural Networks

Add code
Jun 30, 2022
Figure 1 for Forecasting Future World Events with Neural Networks
Figure 2 for Forecasting Future World Events with Neural Networks
Figure 3 for Forecasting Future World Events with Neural Networks
Figure 4 for Forecasting Future World Events with Neural Networks
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Teaching Models to Express Their Uncertainty in Words

Add code
May 28, 2022
Figure 1 for Teaching Models to Express Their Uncertainty in Words
Figure 2 for Teaching Models to Express Their Uncertainty in Words
Figure 3 for Teaching Models to Express Their Uncertainty in Words
Figure 4 for Teaching Models to Express Their Uncertainty in Words
Viaarxiv icon