Picture for Zekun Li

Zekun Li

Ehsan

Survey on AI-Generated Media Detection: From Non-MLLM to MLLM

Add code
Feb 07, 2025
Viaarxiv icon

PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation

Add code
Nov 26, 2024
Figure 1 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 2 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 3 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Figure 4 for PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
Viaarxiv icon

Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey

Add code
Nov 14, 2024
Figure 1 for Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Figure 2 for Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Figure 3 for Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Figure 4 for Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey
Viaarxiv icon

Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension

Add code
Oct 05, 2024
Figure 1 for Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension
Figure 2 for Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension
Figure 3 for Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension
Figure 4 for Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Figure 1 for Can Editing LLMs Inject Harm?
Figure 2 for Can Editing LLMs Inject Harm?
Figure 3 for Can Editing LLMs Inject Harm?
Figure 4 for Can Editing LLMs Inject Harm?
Viaarxiv icon

MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension

Add code
Jul 06, 2024
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

Add code
Jun 13, 2024
Figure 1 for MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Figure 2 for MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Figure 3 for MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Figure 4 for MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
Viaarxiv icon

Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner

Add code
Jun 01, 2024
Viaarxiv icon

FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

Add code
Mar 04, 2024
Figure 1 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 2 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 3 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Figure 4 for FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Viaarxiv icon