Picture for Qi Zheng

Qi Zheng

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

Add code
Jul 17, 2024
Viaarxiv icon

Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated

Add code
Jun 07, 2024
Figure 1 for Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated
Figure 2 for Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated
Figure 3 for Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated
Figure 4 for Advanced Payment Security System:XGBoost, CatBoost and SMOTE Integrated
Viaarxiv icon

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Add code
Apr 24, 2024
Figure 1 for AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Figure 2 for AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Figure 3 for AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Figure 4 for AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results
Viaarxiv icon

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

Add code
Apr 08, 2024
Figure 1 for LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Figure 2 for LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Figure 3 for LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Figure 4 for LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Viaarxiv icon

Vision Grid Transformer for Document Layout Analysis

Add code
Aug 29, 2023
Figure 1 for Vision Grid Transformer for Document Layout Analysis
Figure 2 for Vision Grid Transformer for Document Layout Analysis
Figure 3 for Vision Grid Transformer for Document Layout Analysis
Figure 4 for Vision Grid Transformer for Document Layout Analysis
Viaarxiv icon

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

Add code
Aug 24, 2023
Figure 1 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 2 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 3 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Figure 4 for LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Viaarxiv icon

GeoLayoutLM: Geometric Pre-training for Visual Information Extraction

Add code
Apr 21, 2023
Figure 1 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 2 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 3 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Figure 4 for GeoLayoutLM: Geometric Pre-training for Visual Information Extraction
Viaarxiv icon

ESceme: Vision-and-Language Navigation with Episodic Scene Memory

Add code
Mar 07, 2023
Figure 1 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 2 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 3 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Figure 4 for ESceme: Vision-and-Language Navigation with Episodic Scene Memory
Viaarxiv icon

LORE: Logical Location Regression Network for Table Structure Recognition

Add code
Mar 07, 2023
Figure 1 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 2 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 3 for LORE: Logical Location Regression Network for Table Structure Recognition
Figure 4 for LORE: Logical Location Regression Network for Table Structure Recognition
Viaarxiv icon