Picture for Yifan Xu

Yifan Xu

SyncBreaker:Stage-Aware Multimodal Adversarial Attacks on Audio-Driven Talking Head Generation

Add code
Apr 09, 2026
Viaarxiv icon

Dialogue based Interactive Explanations for Safety Decisions in Human Robot Collaboration

Add code
Apr 07, 2026
Viaarxiv icon

Planning to Explore: Curiosity-Driven Planning for LLM Test Generation

Add code
Apr 06, 2026
Viaarxiv icon

Towards Domain-Generalized Open-Vocabulary Object Detection: A Progressive Domain-invariant Cross-modal Alignment Method

Add code
Mar 29, 2026
Viaarxiv icon

A Step Toward Federated Pretraining of Multimodal Large Language Models

Add code
Mar 25, 2026
Viaarxiv icon

KUKAloha: A General, Low-Cost, and Shared-Control based Teleoperation Framework for Construction Robot Arm

Add code
Mar 20, 2026
Viaarxiv icon

SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

Add code
Mar 03, 2026
Viaarxiv icon

WISER: Wider Search, Deeper Thinking, and Adaptive Fusion for Training-Free Zero-Shot Composed Image Retrieval

Add code
Feb 26, 2026
Viaarxiv icon

Tele-Omni: a Unified Multimodal Framework for Video Generation and Editing

Add code
Feb 10, 2026
Viaarxiv icon

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon