Alert button
Picture for Diyi Yang

Diyi Yang

Alert button

How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs

Jan 23, 2024
Yi Zeng, Hongpeng Lin, Jingwen Zhang, Diyi Yang, Ruoxi Jia, Weiyan Shi

Viaarxiv icon

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media

Nov 16, 2023
Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang

Viaarxiv icon

Grounding or Guesswork? Large Language Models are Presumptive Grounders

Nov 15, 2023
Omar Shaikh, Kristina Gligorić, Ashna Khetan, Matthias Gerstgrasser, Diyi Yang, Dan Jurafsky

Viaarxiv icon

A Material Lens on Coloniality in NLP

Nov 14, 2023
William Held, Camille Harris, Michael Best, Diyi Yang

Figure 1 for A Material Lens on Coloniality in NLP
Figure 2 for A Material Lens on Coloniality in NLP
Figure 3 for A Material Lens on Coloniality in NLP
Figure 4 for A Material Lens on Coloniality in NLP
Viaarxiv icon

Task-Agnostic Low-Rank Adapters for Unseen English Dialects

Nov 02, 2023
Zedian Xiao, William Held, Yanchen Liu, Diyi Yang

Viaarxiv icon

Unlearn What You Want to Forget: Efficient Unlearning for LLMs

Oct 31, 2023
Jiaao Chen, Diyi Yang

Viaarxiv icon

Impressions: Understanding Visual Semiotics and Aesthetic Impact

Oct 27, 2023
Julia Kruk, Caleb Ziems, Diyi Yang

Viaarxiv icon

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation

Oct 24, 2023
Minzhi Li, Taiwei Shi, Caleb Ziems, Min-Yen Kan, Nancy F. Chen, Zhengyuan Liu, Diyi Yang

Figure 1 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
Figure 2 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
Figure 3 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
Figure 4 for CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation
Viaarxiv icon

CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations

Oct 17, 2023
Myra Cheng, Tiziano Piccardi, Diyi Yang

Viaarxiv icon