Picture for Tao Wu

Tao Wu

Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding

Add code
Apr 15, 2026
Viaarxiv icon

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results

Add code
Apr 13, 2026
Viaarxiv icon

System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games

Add code
Mar 26, 2026
Viaarxiv icon

MVPBench: A Multi-Video Perception Evaluation Benchmark for Multi-Modal Video Understanding

Add code
Mar 24, 2026
Viaarxiv icon

SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization

Add code
Mar 10, 2026
Viaarxiv icon

GeodesicNVS: Probability Density Geodesic Flow Matching for Novel View Synthesis

Add code
Mar 01, 2026
Viaarxiv icon

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Add code
Feb 09, 2026
Viaarxiv icon

Spatially Generalizable Mobile Manipulation via Adaptive Experience Selection and Dynamic Imagination

Add code
Jan 21, 2026
Viaarxiv icon

RoLID-11K: A Dashcam Dataset for Small-Object Roadside Litter Detection

Add code
Jan 01, 2026
Viaarxiv icon

Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation

Add code
Dec 15, 2025
Figure 1 for Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation
Figure 2 for Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation
Figure 3 for Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation
Figure 4 for Seeing the Whole Picture: Distribution-Guided Data-Free Distillation for Semantic Segmentation
Viaarxiv icon