Picture for Lianwen Jin

Lianwen Jin

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Add code
May 07, 2024
Figure 1 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 2 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 3 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Figure 4 for DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
Viaarxiv icon

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

Add code
Apr 30, 2024
Figure 1 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 2 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 3 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Figure 4 for VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization
Viaarxiv icon

Bridging the Gap Between End-to-End and Two-Step Text Spotting

Add code
Apr 06, 2024
Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Figure 1 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 2 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 3 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 4 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Viaarxiv icon

DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

Add code
Mar 08, 2024
Figure 1 for DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
Figure 2 for DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
Figure 3 for DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
Figure 4 for DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
Viaarxiv icon

Datasets for Large Language Models: A Comprehensive Survey

Add code
Feb 28, 2024
Figure 1 for Datasets for Large Language Models: A Comprehensive Survey
Figure 2 for Datasets for Large Language Models: A Comprehensive Survey
Figure 3 for Datasets for Large Language Models: A Comprehensive Survey
Figure 4 for Datasets for Large Language Models: A Comprehensive Survey
Viaarxiv icon

An open dataset for oracle bone script recognition and decipherment

Add code
Jan 27, 2024
Viaarxiv icon

An open dataset for the evolution of oracle bone characters: EVOBC

Add code
Jan 23, 2024
Viaarxiv icon

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting

Add code
Jan 15, 2024
Viaarxiv icon

PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

Add code
Jan 07, 2024
Figure 1 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 2 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 3 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Figure 4 for PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction
Viaarxiv icon