Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

MathDoc: Benchmarking Structured Extraction and Active Refusal on Noisy Mathematics Exam Papers

Add code
Jan 15, 2026
Viaarxiv icon

PARL: Position-Aware Relation Learning Network for Document Layout Analysis

Add code
Jan 12, 2026
Viaarxiv icon

IndicDLP: A Foundational Dataset for Multi-Lingual and Multi-Domain Document Layout Parsing

Add code
Dec 23, 2025
Viaarxiv icon

Layout-Aware Text Editing for Efficient Transformation of Academic PDFs to Markdown

Add code
Dec 19, 2025
Viaarxiv icon

Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction

Add code
Dec 24, 2025
Viaarxiv icon

From Show Programmes to Data: Designing a Workflow to Make Performing Arts Ephemera Accessible Through Language Models

Add code
Dec 08, 2025
Figure 1 for From Show Programmes to Data: Designing a Workflow to Make Performing Arts Ephemera Accessible Through Language Models
Figure 2 for From Show Programmes to Data: Designing a Workflow to Make Performing Arts Ephemera Accessible Through Language Models
Figure 3 for From Show Programmes to Data: Designing a Workflow to Make Performing Arts Ephemera Accessible Through Language Models
Figure 4 for From Show Programmes to Data: Designing a Workflow to Make Performing Arts Ephemera Accessible Through Language Models
Viaarxiv icon

LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis

Add code
Nov 13, 2025
Figure 1 for LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Figure 2 for LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Figure 3 for LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Figure 4 for LLM-Guided Probabilistic Fusion for Label-Efficient Document Layout Analysis
Viaarxiv icon

MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns

Add code
Nov 16, 2025
Viaarxiv icon

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Add code
Oct 30, 2025
Viaarxiv icon

Moving Pictures of Thought: Extracting Visual Knowledge in Charles S. Peirce's Manuscripts with Vision-Language Models

Add code
Nov 17, 2025
Viaarxiv icon