Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

Unsupervised Document and Template Clustering using Multimodal Embeddings

Add code
Jun 13, 2025
Viaarxiv icon

MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval

Add code
Jun 14, 2025
Viaarxiv icon

Digitization of Document and Information Extraction using OCR

Add code
Jun 11, 2025
Viaarxiv icon

Creating a Historical Migration Dataset from Finnish Church Records, 1800-1920

Add code
Jun 09, 2025
Viaarxiv icon

MonkeyOCR: Document Parsing with a Structure-Recognition-Relation Triplet Paradigm

Add code
Jun 05, 2025
Viaarxiv icon

SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation

Add code
May 20, 2025
Viaarxiv icon

A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court

Add code
May 13, 2025
Viaarxiv icon

Document Image Rectification Bases on Self-Adaptive Multitask Fusion

Add code
May 09, 2025
Viaarxiv icon

DIMT25@ICDAR2025: HW-TSC's End-to-End Document Image Machine Translation System Leveraging Large Vision-Language Model

Add code
Apr 24, 2025
Viaarxiv icon

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Add code
Apr 05, 2025
Viaarxiv icon