Alert button
Picture for Cong Yao

Cong Yao

Alert button

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Mar 20, 2024
Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang, Cong Yao, Lianwen Jin

Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Jan 03, 2024
Rujiao Long, Hangdi Xing, Zhibo Yang, Qi Zheng, Zhi Yu, Cong Yao, Fei Huang

Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Dec 19, 2023
Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin

Viaarxiv icon

DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

Oct 19, 2023
Cong Yao

Viaarxiv icon

Vision Grid Transformer for Document Layout Analysis

Aug 29, 2023
Cheng Da, Chuwei Luo, Qi Zheng, Cong Yao

Figure 1 for Vision Grid Transformer for Document Layout Analysis
Figure 2 for Vision Grid Transformer for Document Layout Analysis
Figure 3 for Vision Grid Transformer for Document Layout Analysis
Figure 4 for Vision Grid Transformer for Document Layout Analysis
Viaarxiv icon

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

Aug 24, 2023
Changxu Cheng, Peng Wang, Cheng Da, Qi Zheng, Cong Yao

Figure 1 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 2 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 3 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 4 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Viaarxiv icon

Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

Jul 25, 2023
Cheng Da, Peng Wang, Cong Yao

Figure 1 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 2 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 3 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 4 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Viaarxiv icon

Conditional Text Image Generation with Diffusion Models

Jun 19, 2023
Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao

Figure 1 for Conditional Text Image Generation with Diffusion Models
Figure 2 for Conditional Text Image Generation with Diffusion Models
Figure 3 for Conditional Text Image Generation with Diffusion Models
Figure 4 for Conditional Text Image Generation with Diffusion Models
Viaarxiv icon

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

Apr 21, 2023
Chuwei Luo, Changxu Cheng, Qi Zheng, Cong Yao

Figure 1 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 2 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 3 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 4 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Mar 29, 2023
Zhibo Yang, Rujiao Long, Pengfei Wang, Sibo Song, Humen Zhong, Wenqing Cheng, Xiang Bai, Cong Yao

Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon