Picture for Maarten Sap

Maarten Sap

Shammie

Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance

Add code
Jul 10, 2024
Viaarxiv icon

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Add code
Jun 26, 2024
Viaarxiv icon

HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs

Add code
May 27, 2024
Viaarxiv icon

PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models

Add code
May 15, 2024
Figure 1 for PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
Figure 2 for PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
Figure 3 for PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
Figure 4 for PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models
Viaarxiv icon

Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in LLMs

Add code
May 14, 2024
Viaarxiv icon

NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

Add code
Apr 18, 2024
Figure 1 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 2 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 3 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 4 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Viaarxiv icon

Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits

Add code
Mar 21, 2024
Figure 1 for Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits
Figure 2 for Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits
Figure 3 for Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits
Figure 4 for Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits
Viaarxiv icon

SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents

Add code
Mar 14, 2024
Figure 1 for SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
Figure 2 for SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
Figure 3 for SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
Figure 4 for SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents
Viaarxiv icon

Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs

Add code
Mar 08, 2024
Figure 1 for Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
Figure 2 for Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
Figure 3 for Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
Figure 4 for Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs
Viaarxiv icon

Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty

Add code
Jan 12, 2024
Figure 1 for Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Figure 2 for Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Figure 3 for Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Figure 4 for Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Viaarxiv icon