Alert button
Picture for Deqiang Jiang

Deqiang Jiang

Alert button

HRVDA: High-Resolution Visual Document Assistant

Add code
Bookmark button
Alert button
Apr 10, 2024
Chaohu Liu, Kun Yin, Haoyu Cao, Xinghua Jiang, Xin Li, Yinsong Liu, Deqiang Jiang, Xing Sun, Linli Xu

Viaarxiv icon

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun

Viaarxiv icon

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Add code
Bookmark button
Alert button
Dec 20, 2023
Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

Viaarxiv icon

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration

Add code
Bookmark button
Alert button
Sep 03, 2023
Haoyu Cao, Changcun Bao, Chaohu Liu, Huang Chen, Kun Yin, Hao Liu, Yinsong Liu, Deqiang Jiang, Xing Sun

Figure 1 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 2 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 3 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 4 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Viaarxiv icon

Looking and Listening: Audio Guided Text Recognition

Add code
Bookmark button
Alert button
Jun 06, 2023
Wenwen Yu, Mingyu Liu, Biao Yang, Enming Zhang, Deqiang Jiang, Xing Sun, Yuliang Liu, Xiang Bai

Figure 1 for Looking and Listening: Audio Guided Text Recognition
Figure 2 for Looking and Listening: Audio Guided Text Recognition
Figure 3 for Looking and Listening: Audio Guided Text Recognition
Figure 4 for Looking and Listening: Audio Guided Text Recognition
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
Bookmark button
Alert button
May 12, 2023
Jianfeng Kuang, Wei Hua, Dingkang Liang, Mingkun Yang, Deqiang Jiang, Bo Ren, Yu Zhou, Xiang Bai

Figure 1 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 2 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 3 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 4 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Viaarxiv icon

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation

Add code
Bookmark button
Alert button
Mar 16, 2023
Hao Liu, Xin Li, Mingming Gong, Bing Liu, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Xing Sun

Figure 1 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 2 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 3 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 4 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Viaarxiv icon

Turning a CLIP Model into a Scene Text Detector

Add code
Bookmark button
Alert button
Mar 01, 2023
Wenwen Yu, Yuliang Liu, Wei Hua, Deqiang Jiang, Bo Ren, Xiang Bai

Figure 1 for Turning a CLIP Model into a Scene Text Detector
Figure 2 for Turning a CLIP Model into a Scene Text Detector
Figure 3 for Turning a CLIP Model into a Scene Text Detector
Figure 4 for Turning a CLIP Model into a Scene Text Detector
Viaarxiv icon

TaCo: Textual Attribute Recognition via Contrastive Learning

Add code
Bookmark button
Alert button
Aug 22, 2022
Chang Nie, Yiqing Hu, Yanqiu Qu, Hao Liu, Deqiang Jiang, Bo Ren

Figure 1 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 2 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 3 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 4 for TaCo: Textual Attribute Recognition via Contrastive Learning
Viaarxiv icon

GMN: Generative Multi-modal Network for Practical Document Information Extraction

Add code
Bookmark button
Alert button
Jul 11, 2022
Haoyu Cao, Jiefeng Ma, Antai Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren

Figure 1 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 2 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 3 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 4 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Viaarxiv icon