Picture for Pengyuan Lyu

Pengyuan Lyu

StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond

Add code
Jun 04, 2024
Figure 1 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 2 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 3 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Figure 4 for StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Viaarxiv icon

Towards Unified Multi-granularity Text Detection with Interactive Attention

Add code
May 30, 2024
Figure 1 for Towards Unified Multi-granularity Text Detection with Interactive Attention
Figure 2 for Towards Unified Multi-granularity Text Detection with Interactive Attention
Figure 3 for Towards Unified Multi-granularity Text Detection with Interactive Attention
Figure 4 for Towards Unified Multi-granularity Text Detection with Interactive Attention
Viaarxiv icon

GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction

Add code
Sep 26, 2023
Figure 1 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 2 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 3 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Figure 4 for GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction
Viaarxiv icon

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning

Add code
Aug 14, 2023
Figure 1 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 2 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 3 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Figure 4 for Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Jun 05, 2023
Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter

Add code
Jul 18, 2022
Figure 1 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 2 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 3 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Figure 4 for Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter
Viaarxiv icon

MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining

Add code
Jun 01, 2022
Figure 1 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 2 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 3 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Figure 4 for MaskOCR: Text Recognition with Masked Encoder-Decoder Pretraining
Viaarxiv icon

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Add code
Apr 12, 2021
Figure 1 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 2 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 3 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Figure 4 for PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
Viaarxiv icon

Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

Add code
Aug 22, 2019
Figure 1 for Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Figure 2 for Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Figure 3 for Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Figure 4 for Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
Viaarxiv icon

2D Attentional Irregular Scene Text Recognizer

Add code
Jun 13, 2019
Figure 1 for 2D Attentional Irregular Scene Text Recognizer
Figure 2 for 2D Attentional Irregular Scene Text Recognizer
Figure 3 for 2D Attentional Irregular Scene Text Recognizer
Figure 4 for 2D Attentional Irregular Scene Text Recognizer
Viaarxiv icon