Alert button
Picture for Yejin Choi

Yejin Choi

Alert button

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting

Add code
Bookmark button
Alert button
Apr 16, 2024
Huihan Li, Liwei Jiang, Nouha Dziri, Xiang Ren, Yejin Choi

Viaarxiv icon

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

Add code
Bookmark button
Alert button
Apr 15, 2024
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob Foerster, Florian Tramer, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger

Viaarxiv icon

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Add code
Bookmark button
Alert button
Apr 10, 2024
Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Yejin Choi

Viaarxiv icon

Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits

Add code
Bookmark button
Alert button
Mar 21, 2024
Jimin Mun, Liwei Jiang, Jenny Liang, Inyoung Cheong, Nicole DeCario, Yejin Choi, Tadayoshi Kohno, Maarten Sap

Viaarxiv icon

RewardBench: Evaluating Reward Models for Language Modeling

Add code
Bookmark button
Alert button
Mar 20, 2024
Nathan Lambert, Valentina Pyatkin, Jacob Morrison, LJ Miranda, Bill Yuchen Lin, Khyathi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi

Figure 1 for RewardBench: Evaluating Reward Models for Language Modeling
Figure 2 for RewardBench: Evaluating Reward Models for Language Modeling
Figure 3 for RewardBench: Evaluating Reward Models for Language Modeling
Figure 4 for RewardBench: Evaluating Reward Models for Language Modeling
Viaarxiv icon

Information-Theoretic Distillation for Reference-less Summarization

Add code
Bookmark button
Alert button
Mar 20, 2024
Jaehun Jung, Ximing Lu, Liwei Jiang, Faeze Brahman, Peter West, Pang Wei Koh, Yejin Choi

Figure 1 for Information-Theoretic Distillation for Reference-less Summarization
Figure 2 for Information-Theoretic Distillation for Reference-less Summarization
Figure 3 for Information-Theoretic Distillation for Reference-less Summarization
Figure 4 for Information-Theoretic Distillation for Reference-less Summarization
Viaarxiv icon

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Add code
Bookmark button
Alert button
Mar 05, 2024
Aly M. Kassem, Omar Mahmoud, Niloofar Mireshghallah, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana

Figure 1 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 2 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 3 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 4 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Add code
Bookmark button
Alert button
Feb 23, 2024
Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu

Viaarxiv icon

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

Add code
Bookmark button
Alert button
Feb 18, 2024
Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren

Viaarxiv icon

L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects

Add code
Bookmark button
Alert button
Feb 14, 2024
Yutaro Yamada, Khyathi Chandu, Yuchen Lin, Jack Hessel, Ilker Yildirim, Yejin Choi

Viaarxiv icon