Picture for Cong Yao

Cong Yao

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Add code
Dec 19, 2023
Viaarxiv icon

DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

Add code
Oct 19, 2023
Figure 1 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 2 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 3 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Figure 4 for DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond
Viaarxiv icon

Vision Grid Transformer for Document Layout Analysis

Add code
Aug 29, 2023
Figure 1 for Vision Grid Transformer for Document Layout Analysis
Figure 2 for Vision Grid Transformer for Document Layout Analysis
Figure 3 for Vision Grid Transformer for Document Layout Analysis
Figure 4 for Vision Grid Transformer for Document Layout Analysis
Viaarxiv icon

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

Add code
Aug 24, 2023
Figure 1 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 2 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 3 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 4 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Viaarxiv icon

Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

Add code
Jul 25, 2023
Figure 1 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 2 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 3 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 4 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Viaarxiv icon

Conditional Text Image Generation with Diffusion Models

Add code
Jun 19, 2023
Viaarxiv icon

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

Add code
Apr 21, 2023
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon

LORE: Logical Location Regression Network for Table Structure Recognition

Add code
Mar 07, 2023
Figure 1 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 2 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 3 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 4 for LORE: Logical Location Regression Network for Table Structure Recognition
Viaarxiv icon