Picture for Cong Yao

Cong Yao

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Figure 1 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 2 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 3 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 4 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Figure 1 for LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Figure 2 for LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Figure 3 for LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Figure 4 for LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training
Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Add code
Dec 19, 2023
Viaarxiv icon

DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

Add code
Oct 19, 2023
Figure 1 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 2 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 3 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 4 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Viaarxiv icon

Vision Grid Transformer for Document Layout Analysis

Add code
Aug 29, 2023
Figure 1 for Vision Grid Transformer for Document Layout Analysis
Figure 2 for Vision Grid Transformer for Document Layout Analysis
Figure 3 for Vision Grid Transformer for Document Layout Analysis
Figure 4 for Vision Grid Transformer for Document Layout Analysis
Viaarxiv icon

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

Add code
Aug 24, 2023
Figure 1 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 2 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 3 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 4 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Viaarxiv icon

Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

Add code
Jul 25, 2023
Figure 1 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 2 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 3 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 4 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Viaarxiv icon

Conditional Text Image Generation with Diffusion Models

Add code
Jun 19, 2023
Viaarxiv icon

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

Add code
Apr 21, 2023
Figure 1 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 2 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 3 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 4 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon