Picture for Hanjun Li

Hanjun Li

VISA: Group-wise Visual Token Selection and Aggregation via Graph Summarization for Efficient MLLMs Inference

Add code
Aug 25, 2025
Viaarxiv icon

Multimodal Label Relevance Ranking via Reinforcement Learning

Add code
Jul 18, 2024
Viaarxiv icon

AdaFocus: Towards End-to-end Weakly Supervised Learning for Long-Video Action Understanding

Add code
Nov 28, 2023
Viaarxiv icon

Unified and Dynamic Graph for Temporal Character Grouping in Long Videos

Add code
Aug 29, 2023
Viaarxiv icon

D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation

Add code
Aug 08, 2023
Figure 1 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 2 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 3 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Figure 4 for D3G: Exploring Gaussian Prior for Temporal Sentence Grounding with Glance Annotation
Viaarxiv icon

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies

Add code
Mar 26, 2023
Figure 1 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 2 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 3 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Figure 4 for Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies
Viaarxiv icon

SIOD: Single Instance Annotated Per Category Per Image for Object Detection

Add code
Mar 30, 2022
Figure 1 for SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Figure 2 for SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Figure 3 for SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Figure 4 for SIOD: Single Instance Annotated Per Category Per Image for Object Detection
Viaarxiv icon

Combined Depth Space based Architecture Search For Person Re-identification

Add code
Apr 09, 2021
Figure 1 for Combined Depth Space based Architecture Search For Person Re-identification
Figure 2 for Combined Depth Space based Architecture Search For Person Re-identification
Figure 3 for Combined Depth Space based Architecture Search For Person Re-identification
Figure 4 for Combined Depth Space based Architecture Search For Person Re-identification
Viaarxiv icon