Picture for Yunhong Wang

Yunhong Wang

Collaborative Multi-Mode Pruning for Vision-Language Models

Add code
Apr 03, 2026
Viaarxiv icon

Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision

Add code
Mar 28, 2026
Viaarxiv icon

Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality

Add code
Mar 15, 2026
Viaarxiv icon

Memory-Guided View Refinement for Dynamic Human-in-the-loop EQA

Add code
Mar 10, 2026
Viaarxiv icon

Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Add code
Feb 13, 2026
Viaarxiv icon

ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving

Add code
Feb 11, 2026
Viaarxiv icon

Beyond Open Vocabulary: Multimodal Prompting for Object Detection in Remote Sensing Images

Add code
Feb 02, 2026
Viaarxiv icon

EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models

Add code
Jan 30, 2026
Viaarxiv icon

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference

Add code
Jan 19, 2026
Viaarxiv icon

WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling

Add code
Dec 16, 2025
Viaarxiv icon