Picture for Weiming Hu

Weiming Hu

Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking

Add code
Jul 19, 2024
Viaarxiv icon

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Add code
Jul 16, 2024
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Figure 1 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 2 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 3 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 4 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Viaarxiv icon

Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction

Add code
Jun 26, 2024
Viaarxiv icon

Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training

Add code
Apr 18, 2024
Figure 1 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 2 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 3 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Figure 4 for Observation, Analysis, and Solution: Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training
Viaarxiv icon

STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians

Add code
Mar 22, 2024
Figure 1 for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Figure 2 for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Figure 3 for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Figure 4 for STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
Viaarxiv icon

BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues

Add code
Mar 11, 2024
Figure 1 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 2 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 3 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Figure 4 for BEV2PR: BEV-Enhanced Visual Place Recognition with Structural Cues
Viaarxiv icon

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

Add code
Mar 08, 2024
Figure 1 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 2 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 3 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Figure 4 for PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
Viaarxiv icon

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training

Add code
Mar 01, 2024
Figure 1 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 2 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 3 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Figure 4 for Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
Viaarxiv icon