Picture for Hongfa Wang

Hongfa Wang

Refer to the report for detailed contributions

MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model

Add code
Oct 11, 2022
Figure 1 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 2 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 3 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 4 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Viaarxiv icon

Unsupervised Hashing with Semantic Concept Mining

Add code
Sep 23, 2022
Figure 1 for Unsupervised Hashing with Semantic Concept Mining
Figure 2 for Unsupervised Hashing with Semantic Concept Mining
Figure 3 for Unsupervised Hashing with Semantic Concept Mining
Figure 4 for Unsupervised Hashing with Semantic Concept Mining
Viaarxiv icon

Adaptive Perception Transformer for Temporal Action Localization

Add code
Aug 25, 2022
Figure 1 for Adaptive Perception Transformer for Temporal Action Localization
Figure 2 for Adaptive Perception Transformer for Temporal Action Localization
Figure 3 for Adaptive Perception Transformer for Temporal Action Localization
Figure 4 for Adaptive Perception Transformer for Temporal Action Localization
Viaarxiv icon

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

Add code
Aug 21, 2022
Figure 1 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 2 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 3 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 4 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Viaarxiv icon

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

Add code
Jul 15, 2022
Figure 1 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 2 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 3 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 4 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Viaarxiv icon

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Add code
Jul 04, 2022
Figure 1 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

Add code
Jul 04, 2022
Figure 1 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining

Add code
Jun 03, 2022
Figure 1 for Egocentric Video-Language Pretraining
Figure 2 for Egocentric Video-Language Pretraining
Figure 3 for Egocentric Video-Language Pretraining
Figure 4 for Egocentric Video-Language Pretraining
Viaarxiv icon

HunYuan_tvr for Text-Video Retrivial

Add code
Apr 14, 2022
Figure 1 for HunYuan_tvr for Text-Video Retrivial
Figure 2 for HunYuan_tvr for Text-Video Retrivial
Figure 3 for HunYuan_tvr for Text-Video Retrivial
Figure 4 for HunYuan_tvr for Text-Video Retrivial
Viaarxiv icon

Deep Unsupervised Hashing with Latent Semantic Components

Add code
Mar 17, 2022
Figure 1 for Deep Unsupervised Hashing with Latent Semantic Components
Figure 2 for Deep Unsupervised Hashing with Latent Semantic Components
Figure 3 for Deep Unsupervised Hashing with Latent Semantic Components
Figure 4 for Deep Unsupervised Hashing with Latent Semantic Components
Viaarxiv icon