Picture for Weiming Hu

Weiming Hu

OneDrive: Unified Multi-Paradigm Driving with Vision-Language-Action Models

Add code
Apr 20, 2026
Viaarxiv icon

SEATrack: Simple, Efficient, and Adaptive Multimodal Tracker

Add code
Apr 14, 2026
Viaarxiv icon

NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Professional Image Quality Assessment (Track 1)

Add code
Apr 14, 2026
Viaarxiv icon

Low-Data Supervised Adaptation Outperforms Prompting for Cloud Segmentation Under Domain Shift

Add code
Apr 10, 2026
Viaarxiv icon

Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation

Add code
Apr 09, 2026
Viaarxiv icon

Beyond Semantic Search: Towards Referential Anchoring in Composed Image Retrieval

Add code
Apr 07, 2026
Viaarxiv icon

MMPhysVideo: Scaling Physical Plausibility in Video Generation via Joint Multimodal Modeling

Add code
Apr 03, 2026
Viaarxiv icon

Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution Alignment

Add code
Mar 30, 2026
Viaarxiv icon

MI-DETR: A Strong Baseline for Moving Infrared Small Target Detection with Bio-Inspired Motion Integration

Add code
Mar 05, 2026
Viaarxiv icon

Arbitrary Ratio Feature Compression via Next Token Prediction

Add code
Feb 12, 2026
Viaarxiv icon