Picture for Haozhe Qi

Haozhe Qi

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

Add code
Mar 30, 2026
Viaarxiv icon

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models

Add code
Mar 18, 2026
Viaarxiv icon

LLaVAction: evaluating and training multi-modal large language models for action recognition

Add code
Mar 24, 2025
Viaarxiv icon

HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

Add code
Feb 26, 2024
Figure 1 for HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Figure 2 for HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Figure 3 for HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Figure 4 for HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Viaarxiv icon

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

Add code
May 28, 2020
Figure 1 for P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
Figure 2 for P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
Figure 3 for P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
Figure 4 for P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
Viaarxiv icon