Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

Add code
Jun 04, 2026
Viaarxiv icon

End-to-End Text Line Detection and Ordering

Add code
Jun 02, 2026
Viaarxiv icon

PereStruct: Multimodal Semantic Assembly for Robust Historical Document Parsing

Add code
Jun 03, 2026
Viaarxiv icon

Multimodal Approaches for Visually-Rich Document Type Classification: A Comparative Analysis

Add code
Jun 01, 2026
Viaarxiv icon

Enginuity: A Dataset and Benchmark for Vision-Language Understanding of Engineering Diagrams

Add code
Jun 02, 2026
Viaarxiv icon

Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing

Add code
May 31, 2026
Viaarxiv icon

Can Retrieval Heads See Images? Multimodal Retrieval Heads in Long-Context Vision-Language Models

Add code
May 26, 2026
Viaarxiv icon

How Do Document Parsers Break? Auditing Structural Vulnerability in Document Intelligence

Add code
May 19, 2026
Viaarxiv icon

Structured Layout Priors for Robust Out-of-Distribution Visual Document Understanding

Add code
May 19, 2026
Viaarxiv icon

LLM-Augmented Semantic Steering of Text Embedding Projection Spaces

Add code
May 03, 2026
Viaarxiv icon