Picture for Chao Du

Chao Du

Revisiting Backdoor Attacks against Large Vision-Language Models

Add code
Jun 27, 2024
Viaarxiv icon

Bootstrapping Language Models with DPO Implicit Rewards

Add code
Jun 14, 2024
Viaarxiv icon

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs

Add code
Jun 13, 2024
Viaarxiv icon

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

Add code
Jun 03, 2024
Viaarxiv icon

Improved Techniques for Optimization-Based Jailbreaking on Large Language Models

Add code
May 31, 2024
Figure 1 for Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Figure 2 for Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Figure 3 for Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Figure 4 for Improved Techniques for Optimization-Based Jailbreaking on Large Language Models
Viaarxiv icon

Graph Diffusion Policy Optimization

Add code
Feb 26, 2024
Figure 1 for Graph Diffusion Policy Optimization
Figure 2 for Graph Diffusion Policy Optimization
Figure 3 for Graph Diffusion Policy Optimization
Figure 4 for Graph Diffusion Policy Optimization
Viaarxiv icon

Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One

Add code
Feb 19, 2024
Figure 1 for Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Figure 2 for Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Figure 3 for Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Figure 4 for Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
Viaarxiv icon

Purifying Large Language Models by Ensembling a Small Language Model

Add code
Feb 19, 2024
Figure 1 for Purifying Large Language Models by Ensembling a Small Language Model
Figure 2 for Purifying Large Language Models by Ensembling a Small Language Model
Figure 3 for Purifying Large Language Models by Ensembling a Small Language Model
Figure 4 for Purifying Large Language Models by Ensembling a Small Language Model
Viaarxiv icon

Test-Time Backdoor Attacks on Multimodal Large Language Models

Add code
Feb 13, 2024
Figure 1 for Test-Time Backdoor Attacks on Multimodal Large Language Models
Figure 2 for Test-Time Backdoor Attacks on Multimodal Large Language Models
Figure 3 for Test-Time Backdoor Attacks on Multimodal Large Language Models
Figure 4 for Test-Time Backdoor Attacks on Multimodal Large Language Models
Viaarxiv icon

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Add code
Feb 13, 2024
Figure 1 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 2 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 3 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Figure 4 for Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Viaarxiv icon