Picture for Shiguang Shan

Shiguang Shan

Component-Based Out-of-Distribution Detection

Add code
Apr 23, 2026
Viaarxiv icon

EgoMotion: Hierarchical Reasoning and Diffusion for Egocentric Vision-Language Motion Generation

Add code
Apr 21, 2026
Viaarxiv icon

Bidirectional Learning of Facial Action Units and Expressions via Structured Semantic Mapping across Heterogeneous Datasets

Add code
Apr 12, 2026
Viaarxiv icon

ACT Now: Preempting LVLM Hallucinations via Adaptive Context Integration

Add code
Apr 01, 2026
Viaarxiv icon

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Add code
Mar 25, 2026
Viaarxiv icon

Neural Gate: Mitigating Privacy Risks in LVLMs via Neuron-Level Gradient Gating

Add code
Mar 13, 2026
Viaarxiv icon

What Makes VLMs Robust? Towards Reconciling Robustness and Accuracy in Vision-Language Models

Add code
Mar 13, 2026
Viaarxiv icon

INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs

Add code
Mar 12, 2026
Viaarxiv icon

OSI: One-step Inversion Excels in Extracting Diffusion Watermarks

Add code
Feb 10, 2026
Viaarxiv icon

Contrastive Spectral Rectification: Test-Time Defense towards Zero-shot Adversarial Robustness of CLIP

Add code
Jan 27, 2026
Viaarxiv icon