Alert button
Picture for Maarten Sap

Maarten Sap

Alert button

FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions

Add code
Bookmark button
Alert button
Oct 25, 2023
Hyunwoo Kim, Melanie Sclar, Xuhui Zhou, Ronan Le Bras, Gunhee Kim, Yejin Choi, Maarten Sap

Figure 1 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 2 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 3 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Figure 4 for FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions
Viaarxiv icon

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Add code
Bookmark button
Alert button
Oct 18, 2023
Xuhui Zhou, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency, Yonatan Bisk, Daniel Fried, Graham Neubig, Maarten Sap

Figure 1 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 2 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 3 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Figure 4 for SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Viaarxiv icon

Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties

Add code
Bookmark button
Alert button
Sep 02, 2023
Taylor Sorensen, Liwei Jiang, Jena Hwang, Sydney Levine, Valentina Pyatkin, Peter West, Nouha Dziri, Ximing Lu, Kavel Rao, Chandra Bhagavatula, Maarten Sap, John Tasioulas, Yejin Choi

Figure 1 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 2 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 3 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Figure 4 for Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties
Viaarxiv icon

COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements

Add code
Bookmark button
Alert button
Jun 09, 2023
Xuhui Zhou, Hao Zhu, Akhila Yerukola, Thomas Davidson, Jena D. Hwang, Swabha Swayamdipta, Maarten Sap

Figure 1 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 2 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 3 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Figure 4 for COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements
Viaarxiv icon

NLPositionality: Characterizing Design Biases of Datasets and Models

Add code
Bookmark button
Alert button
Jun 02, 2023
Sebastin Santy, Jenny T. Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap

Figure 1 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 2 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 3 for NLPositionality: Characterizing Design Biases of Datasets and Models
Figure 4 for NLPositionality: Characterizing Design Biases of Datasets and Models
Viaarxiv icon

From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models

Add code
Bookmark button
Alert button
May 26, 2023
Julia Mendelsohn, Ronan Le Bras, Yejin Choi, Maarten Sap

Figure 1 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 2 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 3 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Figure 4 for From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models
Viaarxiv icon

Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models

Add code
Bookmark button
Alert button
May 24, 2023
Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz

Figure 1 for Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Figure 2 for Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Figure 3 for Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Figure 4 for Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models
Viaarxiv icon

Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting

Add code
Bookmark button
Alert button
May 24, 2023
Akhila Yerukola, Xuhui Zhou, Maarten Sap

Figure 1 for Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Figure 2 for Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Figure 3 for Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Figure 4 for Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting
Viaarxiv icon

Improving Language Models with Advantage-based Offline Policy Gradients

Add code
Bookmark button
Alert button
May 24, 2023
Ashutosh Baheti, Ximing Lu, Faeze Brahman, Ronan Le Bras, Maarten Sap, Mark Riedl

Figure 1 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 2 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 3 for Improving Language Models with Advantage-based Offline Policy Gradients
Figure 4 for Improving Language Models with Advantage-based Offline Policy Gradients
Viaarxiv icon

Modeling Empathic Similarity in Personal Narratives

Add code
Bookmark button
Alert button
May 23, 2023
Jocelyn Shen, Maarten Sap, Pedro Colon-Hernandez, Hae Won Park, Cynthia Breazeal

Figure 1 for Modeling Empathic Similarity in Personal Narratives
Figure 2 for Modeling Empathic Similarity in Personal Narratives
Figure 3 for Modeling Empathic Similarity in Personal Narratives
Figure 4 for Modeling Empathic Similarity in Personal Narratives
Viaarxiv icon