Alert button
Picture for Maarten Sap

Maarten Sap

Alert button

Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty

Jan 12, 2024
Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Maarten Sap

Viaarxiv icon

Riveter: Measuring Power and Social Dynamics Between Entities

Dec 15, 2023
Maria Antoniak, Anjalie Field, Jimin Mun, Melanie Walsh, Lauren F. Klein, Maarten Sap

Viaarxiv icon

Where Do People Tell Stories Online? Story Detection Across Online Communities

Nov 16, 2023
Maria Antoniak, Joel Mire, Maarten Sap, Elliott Ash, Andrew Piper

Figure 1 for Where Do People Tell Stories Online? Story Detection Across Online Communities
Figure 2 for Where Do People Tell Stories Online? Story Detection Across Online Communities
Figure 3 for Where Do People Tell Stories Online? Story Detection Across Online Communities
Figure 4 for Where Do People Tell Stories Online? Story Detection Across Online Communities
Viaarxiv icon

Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language

Oct 31, 2023
Jimin Mun, Emily Allaway, Akhila Yerukola, Laura Vianna, Sarah-Jane Leslie, Maarten Sap

Viaarxiv icon

FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions

Oct 31, 2023
Hyunwoo Kim, Melanie Sclar, Xuhui Zhou, Ronan Le Bras, Gunhee Kim, Yejin Choi, Maarten Sap

Figure 1 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 2 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 3 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 4 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Viaarxiv icon

Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

Oct 27, 2023
Niloofar Mireshghallah, Hyunwoo Kim, Xuhui Zhou, Yulia Tsvetkov, Maarten Sap, Reza Shokri, Yejin Choi

Figure 1 for Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory
Figure 2 for Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory
Figure 3 for Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory
Figure 4 for Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory
Viaarxiv icon

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Oct 18, 2023
Xuhui Zhou, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency, Yonatan Bisk, Daniel Fried, Graham Neubig, Maarten Sap

Figure 1 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 2 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 3 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 4 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Viaarxiv icon

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties

Sep 02, 2023
Taylor Sorensen, Liwei Jiang, Jena Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin Choi

Figure 1 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 2 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 3 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 4 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Viaarxiv icon

COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements

Jun 09, 2023
Xuhui Zhou, Hao Zhu, Akhila Yerukola, Thomas Davidson, Jena D. Hwang, Swabha Swayamdipta, Maarten Sap

Figure 1 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 2 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 3 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 4 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Viaarxiv icon