Picture for Hantao Zhou

Hantao Zhou

RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model

Add code
Jun 14, 2024
Viaarxiv icon

UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment

Add code
Jun 03, 2024
Viaarxiv icon

Video Object Segmentation with Dynamic Query Modulation

Add code
Mar 18, 2024
Figure 1 for Video Object Segmentation with Dynamic Query Modulation
Figure 2 for Video Object Segmentation with Dynamic Query Modulation
Figure 3 for Video Object Segmentation with Dynamic Query Modulation
Figure 4 for Video Object Segmentation with Dynamic Query Modulation
Viaarxiv icon

UniHead: Unifying Multi-Perception for Detection Heads

Add code
Sep 23, 2023
Viaarxiv icon

SemanticAC: Semantics-Assisted Framework for Audio Classification

Feb 12, 2023
Figure 1 for SemanticAC: Semantics-Assisted Framework for Audio Classification
Figure 2 for SemanticAC: Semantics-Assisted Framework for Audio Classification
Figure 3 for SemanticAC: Semantics-Assisted Framework for Audio Classification
Figure 4 for SemanticAC: Semantics-Assisted Framework for Audio Classification
Viaarxiv icon