Picture for Kanghao Chen

Kanghao Chen

Panoramic Affordance Prediction

Add code
Mar 16, 2026
Viaarxiv icon

DVD: Deterministic Video Depth Estimation with Generative Priors

Add code
Mar 12, 2026
Viaarxiv icon

EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation

Add code
Mar 10, 2026
Viaarxiv icon

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Add code
Dec 16, 2025
Figure 1 for A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Figure 2 for A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Figure 3 for A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Figure 4 for A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Viaarxiv icon

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Add code
Nov 17, 2025
Viaarxiv icon

T-Rex-Omni: Integrating Negative Visual Prompt in Generic Object Detection

Add code
Nov 12, 2025
Viaarxiv icon

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Add code
Oct 10, 2025
Viaarxiv icon

Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection

Add code
Aug 25, 2025
Figure 1 for Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection
Figure 2 for Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection
Figure 3 for Hierarchical Vision-Language Learning for Medical Out-of-Distribution Detection
Viaarxiv icon

DiMeR: Disentangled Mesh Reconstruction Model

Add code
Apr 24, 2025
Figure 1 for DiMeR: Disentangled Mesh Reconstruction Model
Figure 2 for DiMeR: Disentangled Mesh Reconstruction Model
Figure 3 for DiMeR: Disentangled Mesh Reconstruction Model
Figure 4 for DiMeR: Disentangled Mesh Reconstruction Model
Viaarxiv icon

Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration

Add code
Sep 25, 2024
Viaarxiv icon