Alert button
Picture for Jiefeng Ma

Jiefeng Ma

Alert button

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Mar 07, 2024
Yusheng Dai, Hang Chen, Jun Du, Ruoyu Wang, Shihao Chen, Jiefeng Ma, Haotian Wang, Chin-Hui Lee

Figure 1 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 2 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 3 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 4 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Viaarxiv icon

Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition

Add code
Bookmark button
Alert button
Dec 31, 2023
Hanbo Cheng, Chenyu Liu, Pengfei Hu, Zhenrong Zhang, Jiefeng Ma, Jun Du

Viaarxiv icon

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

Add code
Bookmark button
Alert button
Sep 11, 2023
Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song, Qing Wang, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu, Ya Jiang, Shi Cheng, Jie Zhang, Yuzhe Weng

Figure 1 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 2 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 3 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Figure 4 for Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023
Viaarxiv icon

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Add code
Bookmark button
Alert button
Jul 30, 2023
Pengfei Hu, Jiefeng Ma, Zhenrong Zhang, Jun Du, Jianshu Zhang

Figure 1 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 2 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 3 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Figure 4 for Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction
Viaarxiv icon

HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures

Add code
Bookmark button
Alert button
Mar 24, 2023
Jiefeng Ma, Jun Du, Pengfei Hu, Zhenrong Zhang, Jianshu Zhang, Huihui Zhu, Cong Liu

Figure 1 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 2 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 3 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Figure 4 for HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures
Viaarxiv icon

SEMv2: Table Separation Line Detection Based on Conditional Convolution

Add code
Bookmark button
Alert button
Mar 08, 2023
Zhenrong Zhang, Pengfei Hu, Jiefeng Ma, Jun Du, Jianshu Zhang, Huihui Zhu, Baocai Yin, Bing Yin, Cong Liu

Figure 1 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 2 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 3 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Figure 4 for SEMv2: Table Separation Line Detection Based on Conditional Convolution
Viaarxiv icon

GMN: Generative Multi-modal Network for Practical Document Information Extraction

Add code
Bookmark button
Alert button
Jul 11, 2022
Haoyu Cao, Jiefeng Ma, Antai Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren

Figure 1 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 2 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 3 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Figure 4 for GMN: Generative Multi-modal Network for Practical Document Information Extraction
Viaarxiv icon

Multimodal Pre-training Based on Graph Attention Network for Document Understanding

Add code
Bookmark button
Alert button
Mar 25, 2022
Zhenrong Zhang, Jiefeng Ma, Jun Du, Licheng Wang, Jianshu Zhang

Figure 1 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 2 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 3 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Figure 4 for Multimodal Pre-training Based on Graph Attention Network for Document Understanding
Viaarxiv icon