Picture for Hamid Rezatofighi

Hamid Rezatofighi

Marginalized Generalized IoU (MGIoU): A Unified Objective Function for Optimizing Any Convex Parametric Shapes

Add code
Apr 24, 2025
Viaarxiv icon

AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images

Add code
Apr 12, 2025
Figure 1 for AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images
Figure 2 for AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images
Figure 3 for AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images
Figure 4 for AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images
Viaarxiv icon

Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model

Add code
Mar 27, 2025
Viaarxiv icon

DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning

Add code
Mar 25, 2025
Figure 1 for DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
Figure 2 for DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
Figure 3 for DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
Figure 4 for DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
Viaarxiv icon

Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting

Add code
Feb 20, 2025
Figure 1 for Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting
Figure 2 for Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting
Figure 3 for Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting
Figure 4 for Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting
Viaarxiv icon

Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering

Add code
Oct 27, 2024
Figure 1 for Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering
Figure 2 for Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering
Figure 3 for Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering
Figure 4 for Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering
Viaarxiv icon

TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene

Add code
Sep 26, 2024
Figure 1 for TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene
Figure 2 for TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene
Figure 3 for TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene
Figure 4 for TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene
Viaarxiv icon

NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions

Add code
Sep 16, 2024
Figure 1 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 2 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 3 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 4 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Viaarxiv icon

How Well Can Vision Language Models See Image Details?

Add code
Aug 07, 2024
Figure 1 for How Well Can Vision Language Models See Image Details?
Figure 2 for How Well Can Vision Language Models See Image Details?
Figure 3 for How Well Can Vision Language Models See Image Details?
Figure 4 for How Well Can Vision Language Models See Image Details?
Viaarxiv icon

DrVideo: Document Retrieval Based Long Video Understanding

Add code
Jun 18, 2024
Figure 1 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 2 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 3 for DrVideo: Document Retrieval Based Long Video Understanding
Figure 4 for DrVideo: Document Retrieval Based Long Video Understanding
Viaarxiv icon