Alert button
Picture for Zhiwen Gui

Zhiwen Gui

Alert button

Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

Add code
Bookmark button
Alert button
Feb 24, 2024
Zhenhua Wang, Wei Xie, Baosheng Wang, Enze Wang, Zhiwen Gui, Shuoyoucheng Ma, Kai Chen

Viaarxiv icon

Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models

Add code
Bookmark button
Alert button
Aug 25, 2023
Zhenhua Wang, Wei Xie, Kai Chen, Baosheng Wang, Zhiwen Gui, Enze Wang

Figure 1 for Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models
Figure 2 for Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models
Figure 3 for Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models
Figure 4 for Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models
Viaarxiv icon