Alert button
Picture for Cong Yao

Cong Yao

Alert button

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Add code
Bookmark button
Alert button
Apr 08, 2024
Chuwei Luo, Yufan Shen, Zhaoqing Zhu, Qi Zheng, Zhi Yu, Cong Yao

Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang

Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Bookmark button
Alert button
Mar 20, 2024
Yuyi Zhang, Yuanzhi Zhu, Dezhi Peng, Peirong Zhang, Zhenhua Yang, Zhibo Yang, Cong Yao, Lianwen Jin

Figure 1 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 2 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 3 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Figure 4 for HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Bookmark button
Alert button
Jan 03, 2024
Rujiao Long, Hangdi Xing, Zhibo Yang, Qi Zheng, Zhi Yu, Cong Yao, Fei Huang

Viaarxiv icon

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Add code
Bookmark button
Alert button
Dec 19, 2023
Zhenhua Yang, Dezhi Peng, Yuxin Kong, Yuyi Zhang, Cong Yao, Lianwen Jin

Viaarxiv icon

DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond

Add code
Bookmark button
Alert button
Oct 19, 2023
Cong Yao

Viaarxiv icon

Vision Grid Transformer for Document Layout Analysis

Add code
Bookmark button
Alert button
Aug 29, 2023
Cheng Da, Chuwei Luo, Qi Zheng, Cong Yao

Figure 1 for Vision Grid Transformer for Document Layout Analysis
Figure 2 for Vision Grid Transformer for Document Layout Analysis
Figure 3 for Vision Grid Transformer for Document Layout Analysis
Figure 4 for Vision Grid Transformer for Document Layout Analysis
Viaarxiv icon

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

Add code
Bookmark button
Alert button
Aug 24, 2023
Changxu Cheng, Peng Wang, Cheng Da, Qi Zheng, Cong Yao

Figure 1 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 2 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 3 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 4 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Viaarxiv icon

Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition

Add code
Bookmark button
Alert button
Jul 25, 2023
Cheng Da, Peng Wang, Cong Yao

Figure 1 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 2 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 3 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Figure 4 for Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition
Viaarxiv icon

Conditional Text Image Generation with Diffusion Models

Add code
Bookmark button
Alert button
Jun 19, 2023
Yuanzhi Zhu, Zhaohai Li, Tianwei Wang, Mengchao He, Cong Yao

Figure 1 for Conditional Text Image Generation with Diffusion Models
Figure 2 for Conditional Text Image Generation with Diffusion Models
Figure 3 for Conditional Text Image Generation with Diffusion Models
Figure 4 for Conditional Text Image Generation with Diffusion Models
Viaarxiv icon