Picture for Weihong Ma

Weihong Ma

StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond

Add code
Jun 04, 2024
Figure 1 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 2 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 3 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 4 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Viaarxiv icon

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction

Add code
Sep 26, 2023
Figure 1 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 2 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 3 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 4 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Viaarxiv icon

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

Add code
Jul 29, 2022
Figure 1 for Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach
Figure 2 for Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach
Figure 3 for Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach
Figure 4 for Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach
Viaarxiv icon

Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator

Add code
Apr 30, 2022
Figure 1 for Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator
Figure 2 for Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator
Figure 3 for Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator
Figure 4 for Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator
Viaarxiv icon

Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

Add code
Jun 20, 2021
Figure 1 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 2 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 3 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Figure 4 for Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences
Viaarxiv icon

Towards an efficient framework for Data Extraction from Chart Images

Add code
May 05, 2021
Figure 1 for Towards an efficient framework for Data Extraction from Chart Images
Figure 2 for Towards an efficient framework for Data Extraction from Chart Images
Figure 3 for Towards an efficient framework for Data Extraction from Chart Images
Figure 4 for Towards an efficient framework for Data Extraction from Chart Images
Viaarxiv icon

Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization

Add code
Jul 14, 2020
Figure 1 for Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization
Figure 2 for Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization
Figure 3 for Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization
Figure 4 for Joint Layout Analysis, Character Detection and Recognition for Historical Document Digitization
Viaarxiv icon