Picture for Jiefeng Ma

Jiefeng Ma

SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding

Add code
Jun 13, 2024
Figure 1 for SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Figure 2 for SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Figure 3 for SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Figure 4 for SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
Viaarxiv icon

SEMv3: A Fast and Robust Approach to Table Separation Line Detection

Add code
May 20, 2024
Viaarxiv icon

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Add code
Mar 07, 2024
Figure 1 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 2 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 3 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 4 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Viaarxiv icon

Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition

Add code
Dec 31, 2023
Viaarxiv icon

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

Add code
Sep 11, 2023
Figure 1 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 2 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 3 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 4 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Viaarxiv icon

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Add code
Jul 30, 2023
Figure 1 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 2 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 3 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 4 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Viaarxiv icon

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

Add code
Mar 24, 2023
Figure 1 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 2 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 3 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 4 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Viaarxiv icon

SEMv2: Table Separation Line Detection Based on Conditional Convolution

Add code
Mar 08, 2023
Figure 1 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 2 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 3 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 4 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Viaarxiv icon

GMN: Generative Multi-modal Network for Practical Document Information Extraction

Add code
Jul 11, 2022
Figure 1 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 2 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 3 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 4 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Viaarxiv icon

Multimodal Pre-training Based on Graph Attention Network for Document Understanding

Add code
Mar 25, 2022
Figure 1 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 2 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 3 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 4 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Viaarxiv icon