Picture for Renshen Wang

Renshen Wang

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

Add code
May 04, 2023
Figure 1 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 2 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 3 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 4 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Viaarxiv icon

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

Add code
May 04, 2023
Figure 1 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 2 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 3 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 4 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Viaarxiv icon

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

Add code
Mar 24, 2022
Figure 1 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 2 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 3 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Figure 4 for FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Viaarxiv icon

Unified Line and Paragraph Detection by Graph Convolutional Networks

Add code
Mar 17, 2022
Figure 1 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 2 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 3 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Figure 4 for Unified Line and Paragraph Detection by Graph Convolutional Networks
Viaarxiv icon

ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Add code
Jun 21, 2021
Figure 1 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 2 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 3 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Figure 4 for ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction
Viaarxiv icon

General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks

Add code
Feb 01, 2021
Figure 1 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 2 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 3 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Figure 4 for General-Purpose OCR Paragraph Identification by Graph Convolutional Neural Networks
Viaarxiv icon