Picture for Muhao Chen

Muhao Chen

University of California Davis

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Characte

Add code
May 25, 2024
Viaarxiv icon

Red Teaming Language Models for Contradictory Dialogues

Add code
May 17, 2024
Figure 1 for Red Teaming Language Models for Contradictory Dialogues
Figure 2 for Red Teaming Language Models for Contradictory Dialogues
Figure 3 for Red Teaming Language Models for Contradictory Dialogues
Figure 4 for Red Teaming Language Models for Contradictory Dialogues
Viaarxiv icon

Offset Unlearning for Large Language Models

Add code
Apr 17, 2024
Figure 1 for Offset Unlearning for Large Language Models
Figure 2 for Offset Unlearning for Large Language Models
Figure 3 for Offset Unlearning for Large Language Models
Figure 4 for Offset Unlearning for Large Language Models
Viaarxiv icon

Planning and Editing What You Retrieve for Enhanced Tool Learning

Add code
Apr 04, 2024
Viaarxiv icon

Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors

Add code
Apr 02, 2024
Viaarxiv icon

Monotonic Paraphrasing Improves Generalization of Language Model Prompting

Add code
Mar 24, 2024
Viaarxiv icon

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Add code
Mar 20, 2024
Figure 1 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 2 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 3 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Figure 4 for Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging
Viaarxiv icon

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Add code
Mar 14, 2024
Figure 1 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 2 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 3 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 4 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Viaarxiv icon

X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification

Add code
Mar 06, 2024
Viaarxiv icon

Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment

Add code
Feb 27, 2024
Figure 1 for Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment
Figure 2 for Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment
Figure 3 for Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment
Figure 4 for Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment
Viaarxiv icon