Picture for Muhao Chen

Muhao Chen

Are Large Language Models Capable of Generating Human-Level Narratives?

Add code
Jul 18, 2024
Viaarxiv icon

CLIMB: A Benchmark of Clinical Bias in Large Language Models

Add code
Jul 07, 2024
Viaarxiv icon

Securing Multi-turn Conversational Language Models Against Distributed Backdoor Triggers

Add code
Jul 04, 2024
Viaarxiv icon

From Introspection to Best Practices: Principled Analysis of Demonstrations in Multimodal In-Context Learning

Add code
Jul 01, 2024
Viaarxiv icon

FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation

Add code
Jun 17, 2024
Figure 1 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 2 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 3 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Figure 4 for FamiCom: Further Demystifying Prompts for Language Models with Task-Agnostic Performance Estimation
Viaarxiv icon

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Add code
Jun 17, 2024
Figure 1 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 2 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 3 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 4 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

Assessing LLMs for Zero-shot Abstractive Summarization Through the Lens of Relevance Paraphrasing

Add code
Jun 06, 2024
Viaarxiv icon

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Characte

Add code
May 25, 2024
Viaarxiv icon

Red Teaming Language Models for Contradictory Dialogues

Add code
May 17, 2024
Viaarxiv icon