Picture for Xilin Chen

Xilin Chen

T2VAttack: Adversarial Attack on Text-to-Video Diffusion Models

Add code
Dec 30, 2025
Viaarxiv icon

Towards Transferable Defense Against Malicious Image Edits

Add code
Dec 16, 2025
Viaarxiv icon

Semantic Mismatch and Perceptual Degradation: A New Perspective on Image Editing Immunity

Add code
Dec 16, 2025
Viaarxiv icon

Dual Attention Guided Defense Against Malicious Edits

Add code
Dec 16, 2025
Viaarxiv icon

VisKnow: Constructing Visual Knowledge Base for Object Understanding

Add code
Dec 09, 2025
Viaarxiv icon

VOPE: Revisiting Hallucination of Vision-Language Models in Voluntary Imagination Task

Add code
Nov 17, 2025
Viaarxiv icon

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Add code
Sep 19, 2025
Viaarxiv icon

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Add code
Jun 17, 2025
Viaarxiv icon

BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation

Add code
Jun 09, 2025
Viaarxiv icon

un$^2$CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP

Add code
May 30, 2025
Viaarxiv icon