Picture for Xinchao Wang

Xinchao Wang

ViMU: Benchmarking Video Metaphorical Understanding

Add code
May 14, 2026
Viaarxiv icon

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Add code
May 12, 2026
Viaarxiv icon

Vision-Language-Action Safety: Threats, Challenges, Evaluations, and Mechanisms

Add code
Apr 26, 2026
Viaarxiv icon

Hypergraph-State Collaborative Reasoning for Multi-Object Tracking

Add code
Apr 14, 2026
Viaarxiv icon

DMax: Aggressive Parallel Decoding for dLLMs

Add code
Apr 09, 2026
Viaarxiv icon

AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration

Add code
Apr 01, 2026
Viaarxiv icon

Gated Condition Injection without Multimodal Attention: Towards Controllable Linear-Attention Transformers

Add code
Mar 29, 2026
Viaarxiv icon

Make Geometry Matter for Spatial Reasoning

Add code
Mar 27, 2026
Viaarxiv icon

Rethinking Token Reduction for Large Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon