Layout To Image Generation


Layout-to-image generation is the process of generating images from layout descriptions using deep learning techniques.

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Add code
Sep 26, 2025
Viaarxiv icon

Layout Stroke Imitation: A Layout Guided Handwriting Stroke Generation for Style Imitation with Diffusion Model

Add code
Sep 19, 2025
Viaarxiv icon

FloorSAM: SAM-Guided Floorplan Reconstruction with Semantic-Geometric Fusion

Add code
Sep 19, 2025
Viaarxiv icon

SPATIALGEN: Layout-guided 3D Indoor Scene Generation

Add code
Sep 18, 2025
Viaarxiv icon

Causal Reasoning Elicits Controllable 3D Scene Generation

Add code
Sep 18, 2025
Viaarxiv icon

TextlessRAG: End-to-End Visual Document RAG by Speech Without Text

Add code
Sep 10, 2025
Viaarxiv icon

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Add code
Sep 09, 2025
Viaarxiv icon

MEPG:Multi-Expert Planning and Generation for Compositionally-Rich Image Generation

Add code
Sep 04, 2025
Viaarxiv icon

Generative AI in Map-Making: A Technical Exploration and Its Implications for Cartographers

Add code
Aug 26, 2025
Viaarxiv icon

LSD-3D: Large-Scale 3D Driving Scene Generation with Geometry Grounding

Add code
Aug 26, 2025
Viaarxiv icon