Alert button
Picture for Allyson Ettinger

Allyson Ettinger

Alert button

When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models

Add code
Bookmark button
Alert button
Apr 14, 2024
Yanhong Li, Chenghao Yang, Allyson Ettinger

Viaarxiv icon

Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently

Add code
Bookmark button
Alert button
Jan 12, 2024
Kanishka Misra, Allyson Ettinger, Kyle Mahowald

Viaarxiv icon

The Generative AI Paradox: "What It Can Create, It May Not Understand"

Add code
Bookmark button
Alert button
Oct 31, 2023
Peter West, Ximing Lu, Nouha Dziri, Faeze Brahman, Linjie Li, Jena D. Hwang, Liwei Jiang, Jillian Fisher, Abhilasha Ravichander, Khyathi Chandu, Benjamin Newman, Pang Wei Koh, Allyson Ettinger, Yejin Choi

Figure 1 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 2 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 3 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Figure 4 for The Generative AI Paradox: "What It Can Create, It May Not Understand"
Viaarxiv icon

"You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation

Add code
Bookmark button
Alert button
Oct 26, 2023
Allyson Ettinger, Jena D. Hwang, Valentina Pyatkin, Chandra Bhagavatula, Yejin Choi

Figure 1 for "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation
Figure 2 for "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation
Figure 3 for "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation
Figure 4 for "You Are An Expert Linguistic Annotator": Limits of LLMs as Analyzers of Abstract Meaning Representation
Viaarxiv icon

Can You Follow Me? Testing Situational Understanding in ChatGPT

Add code
Bookmark button
Alert button
Oct 24, 2023
Chenghao Yang, Allyson Ettinger

Viaarxiv icon

Faith and Fate: Limits of Transformers on Compositionality

Add code
Bookmark button
Alert button
Jun 01, 2023
Nouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena D. Hwang, Soumya Sanyal, Sean Welleck, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi

Figure 1 for Faith and Fate: Limits of Transformers on Compositionality
Figure 2 for Faith and Fate: Limits of Transformers on Compositionality
Figure 3 for Faith and Fate: Limits of Transformers on Compositionality
Figure 4 for Faith and Fate: Limits of Transformers on Compositionality
Viaarxiv icon

Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios

Add code
Bookmark button
Alert button
May 26, 2023
Jiaxuan Li, Lang Yu, Allyson Ettinger

Figure 1 for Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios
Figure 2 for Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios
Figure 3 for Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios
Figure 4 for Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios
Viaarxiv icon

Counterfactual reasoning: Do language models need world knowledge for causal understanding?

Add code
Bookmark button
Alert button
Dec 06, 2022
Jiaxuan Li, Lang Yu, Allyson Ettinger

Figure 1 for Counterfactual reasoning: Do language models need world knowledge for causal understanding?
Figure 2 for Counterfactual reasoning: Do language models need world knowledge for causal understanding?
Figure 3 for Counterfactual reasoning: Do language models need world knowledge for causal understanding?
Figure 4 for Counterfactual reasoning: Do language models need world knowledge for causal understanding?
Viaarxiv icon

COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models

Add code
Bookmark button
Alert button
Oct 06, 2022
Kanishka Misra, Julia Taylor Rayz, Allyson Ettinger

Figure 1 for COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models
Figure 2 for COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models
Figure 3 for COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models
Figure 4 for COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models
Viaarxiv icon

"No, they did not": Dialogue response dynamics in pre-trained language models

Add code
Bookmark button
Alert button
Oct 05, 2022
Sanghee J. Kim, Lang Yu, Allyson Ettinger

Figure 1 for "No, they did not": Dialogue response dynamics in pre-trained language models
Figure 2 for "No, they did not": Dialogue response dynamics in pre-trained language models
Figure 3 for "No, they did not": Dialogue response dynamics in pre-trained language models
Figure 4 for "No, they did not": Dialogue response dynamics in pre-trained language models
Viaarxiv icon