Picture for Yejin Choi

Yejin Choi

Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild

Add code
Jul 16, 2024
Viaarxiv icon

CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation

Add code
Jul 09, 2024
Viaarxiv icon

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Add code
Jul 09, 2024
Viaarxiv icon

Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness

Add code
Jul 02, 2024
Figure 1 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 2 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 3 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Figure 4 for Certainly Uncertain: A Benchmark and Metric for Multimodal Epistemic and Aleatoric Awareness
Viaarxiv icon

Multilingual Trolley Problems for Language Models

Add code
Jul 02, 2024
Viaarxiv icon

How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models

Add code
Jun 29, 2024
Figure 1 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 2 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 3 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 4 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Viaarxiv icon

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Add code
Jun 26, 2024
Viaarxiv icon

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Add code
Jun 26, 2024
Viaarxiv icon

Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

Add code
Jun 22, 2024
Viaarxiv icon

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

Add code
Jun 17, 2024
Viaarxiv icon