Alert button
Picture for Hongfa Wang

Hongfa Wang

Alert button

Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

Mar 13, 2024
Yue Ma, Yingqing He, Hongfa Wang, Andong Wang, Chenyang Qi, Chengfei Cai, Xiu Li, Zhifeng Li, Heung-Yeung Shum, Wei Liu, Qifeng Chen

Viaarxiv icon

Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling

Jan 08, 2024
Shi-Xue Zhang, Chun Yang, Xiaobin Zhu, Hongyang Zhou, Hongfa Wang, Xu-Cheng Yin

Viaarxiv icon

Global and Local Semantic Completion Learning for Vision-Language Pre-training

Jun 12, 2023
Rong-Cheng Tu, Yatai Ji, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

Figure 1 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 2 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 3 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Figure 4 for Global and Local Semantic Completion Learning for Vision-Language Pre-training
Viaarxiv icon

Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders

Apr 25, 2023
Heng Pan, Chenyang Liu, Wenxiao Wang, Li Yuan, Hongfa Wang, Zhifeng Li, Wei Liu

Figure 1 for Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Figure 2 for Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Figure 3 for Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Figure 4 for Img2Vec: A Teacher of High Token-Diversity Helps Masked AutoEncoders
Viaarxiv icon

Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning

Nov 24, 2022
Yatai Ji, Rongcheng Tu, Jie Jiang, Weijie Kong, Chengfei Cai, Wenzhe Zhao, Hongfa Wang, Yujiu Yang, Wei Liu

Figure 1 for Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Figure 2 for Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Figure 3 for Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Figure 4 for Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning
Viaarxiv icon

MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model

Oct 11, 2022
Yatai Ji, Junjie Wang, Yuan Gong, Lin Zhang, Yanru Zhu, Hongfa Wang, Jiaxing Zhang, Tetsuya Sakai, Yujiu Yang

Figure 1 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 2 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 3 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 4 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Viaarxiv icon

Unsupervised Hashing with Semantic Concept Mining

Sep 23, 2022
Rong-Cheng Tu, Xian-Ling Mao, Kevin Qinghong Lin, Chengfei Cai, Weize Qin, Hongfa Wang, Wei Wei, Heyan Huang

Figure 1 for Unsupervised Hashing with Semantic Concept Mining
Figure 2 for Unsupervised Hashing with Semantic Concept Mining
Figure 3 for Unsupervised Hashing with Semantic Concept Mining
Figure 4 for Unsupervised Hashing with Semantic Concept Mining
Viaarxiv icon

Adaptive Perception Transformer for Temporal Action Localization

Aug 25, 2022
Yizheng Ouyang, Tianjin Zhang, Weibo Gu, Hongfa Wang, Liming Wang, Xiaojie Guo

Figure 1 for Adaptive Perception Transformer for Temporal Action Localization
Figure 2 for Adaptive Perception Transformer for Temporal Action Localization
Figure 3 for Adaptive Perception Transformer for Temporal Action Localization
Figure 4 for Adaptive Perception Transformer for Temporal Action Localization
Viaarxiv icon

DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection

Aug 21, 2022
Jingyu Lin, Jie Jiang, Yan Yan, Chunchao Guo, Hongfa Wang, Wei Liu, Hanzi Wang

Figure 1 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 2 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 3 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Figure 4 for DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection
Viaarxiv icon

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization

Jul 15, 2022
Mengyin Liu, Chao Zhu, Hongyu Gao, Weibo Gu, Hongfa Wang, Wei Liu, Xu-cheng Yin

Figure 1 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 2 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 3 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Figure 4 for Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization
Viaarxiv icon