Picture for Yufan Chen

Yufan Chen

Rethinking Video Human-Object Interaction: Set Prediction over Time for Unified Detection and Anticipation

Add code
Apr 12, 2026
Viaarxiv icon

IMPACT: A Dataset for Multi-Granularity Human Procedural Action Understanding in Industrial Assembly

Add code
Apr 12, 2026
Viaarxiv icon

Towards Multi-Source Domain Generalization for Sleep Staging with Noisy Labels

Add code
Apr 11, 2026
Viaarxiv icon

RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization

Add code
Mar 29, 2026
Viaarxiv icon

Not an Obstacle for Dog, but a Hazard for Human: A Co-Ego Navigation System for Guide Dog Robots

Add code
Mar 20, 2026
Viaarxiv icon

InterEdit: Navigating Text-Guided Multi-Human 3D Motion Editing

Add code
Mar 13, 2026
Viaarxiv icon

DriveXQA: Cross-modal Visual Question Answering for Adverse Driving Scene Understanding

Add code
Mar 11, 2026
Viaarxiv icon

More than the Sum: Panorama-Language Models for Adverse Omni-Scenes

Add code
Mar 10, 2026
Viaarxiv icon

$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs

Add code
Mar 10, 2026
Viaarxiv icon

SGR3 Model: Scene Graph Retrieval-Reasoning Model in 3D

Add code
Mar 04, 2026
Viaarxiv icon