Picture for Xiang Bai

Xiang Bai

Huazhong University of Science and Technology

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

Add code
Aug 20, 2023
Viaarxiv icon

SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth

Add code
Jun 08, 2023
Viaarxiv icon

Looking and Listening: Audio Guided Text Recognition

Add code
Jun 06, 2023
Figure 1 for Looking and Listening: Audio Guided Text Recognition
Figure 2 for Looking and Listening: Audio Guided Text Recognition
Figure 3 for Looking and Listening: Audio Guided Text Recognition
Figure 4 for Looking and Listening: Audio Guided Text Recognition
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Viaarxiv icon

SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model

Add code
Jun 04, 2023
Viaarxiv icon

On the Hidden Mystery of OCR in Large Multimodal Models

Add code
May 13, 2023
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Figure 1 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 2 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 3 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 4 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Viaarxiv icon

Multi-Modal 3D Object Detection by Box Matching

Add code
May 12, 2023
Viaarxiv icon

A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension

Add code
May 05, 2023
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Add code
Apr 24, 2023
Viaarxiv icon