Picture for Deqiang Jiang

Deqiang Jiang

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction

Add code
Jun 18, 2024
Figure 1 for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
Figure 2 for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
Figure 3 for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
Figure 4 for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
Viaarxiv icon

HRVDA: High-Resolution Visual Document Assistant

Add code
Apr 10, 2024
Figure 1 for HRVDA: High-Resolution Visual Document Assistant
Figure 2 for HRVDA: High-Resolution Visual Document Assistant
Figure 3 for HRVDA: High-Resolution Visual Document Assistant
Figure 4 for HRVDA: High-Resolution Visual Document Assistant
Viaarxiv icon

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Add code
Feb 29, 2024
Figure 1 for Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
Figure 2 for Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
Figure 3 for Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
Figure 4 for Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
Viaarxiv icon

A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Add code
Dec 20, 2023
Viaarxiv icon

Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration

Add code
Sep 03, 2023
Figure 1 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 2 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 3 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Figure 4 for Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Viaarxiv icon

Looking and Listening: Audio Guided Text Recognition

Add code
Jun 06, 2023
Figure 1 for Looking and Listening: Audio Guided Text Recognition
Figure 2 for Looking and Listening: Audio Guided Text Recognition
Figure 3 for Looking and Listening: Audio Guided Text Recognition
Figure 4 for Looking and Listening: Audio Guided Text Recognition
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Figure 1 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 2 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 3 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 4 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Viaarxiv icon

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation

Add code
Mar 16, 2023
Figure 1 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 2 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 3 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Figure 4 for Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
Viaarxiv icon

Turning a CLIP Model into a Scene Text Detector

Add code
Mar 01, 2023
Figure 1 for Turning a CLIP Model into a Scene Text Detector
Figure 2 for Turning a CLIP Model into a Scene Text Detector
Figure 3 for Turning a CLIP Model into a Scene Text Detector
Figure 4 for Turning a CLIP Model into a Scene Text Detector
Viaarxiv icon

TaCo: Textual Attribute Recognition via Contrastive Learning

Add code
Aug 22, 2022
Figure 1 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 2 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 3 for TaCo: Textual Attribute Recognition via Contrastive Learning
Figure 4 for TaCo: Textual Attribute Recognition via Contrastive Learning
Viaarxiv icon