Picture for Xiaobo Xia

Xiaobo Xia

AUHead: Realistic Emotional Talking Head Generation via Action Units Control

Add code
Feb 10, 2026
Viaarxiv icon

SPD-Faith Bench: Diagnosing and Improving Faithfulness in Chain-of-Thought for Multimodal Large Language Models

Add code
Feb 08, 2026
Viaarxiv icon

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Add code
Feb 01, 2026
Viaarxiv icon

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation

Add code
Jan 31, 2026
Viaarxiv icon

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning

Add code
Jan 31, 2026
Viaarxiv icon

Generalizable Multimodal Large Language Model Editing via Invariant Trajectory Learning

Add code
Jan 30, 2026
Viaarxiv icon

Lingua-SafetyBench: A Benchmark for Safety Evaluation of Multilingual Vision-Language Models

Add code
Jan 30, 2026
Viaarxiv icon

Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models

Add code
Jan 28, 2026
Viaarxiv icon

What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study

Add code
Jan 21, 2026
Viaarxiv icon

Calibrated Multimodal Representation Learning with Missing Modalities

Add code
Nov 15, 2025
Figure 1 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 2 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 3 for Calibrated Multimodal Representation Learning with Missing Modalities
Figure 4 for Calibrated Multimodal Representation Learning with Missing Modalities
Viaarxiv icon