Layout To Image Generation


Layout-to-image generation is the process of generating images from layout descriptions using deep learning techniques.

Fast-SAM3D: 3Dfy Anything in Images but Faster

Add code
Feb 05, 2026
Viaarxiv icon

Mitigating Long-Tail Bias via Prompt-Controlled Diffusion Augmentation

Add code
Feb 04, 2026
Viaarxiv icon

Interpretable Logical Anomaly Classification via Constraint Decomposition and Instruction Fine-Tuning

Add code
Feb 03, 2026
Viaarxiv icon

UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Add code
Feb 02, 2026
Viaarxiv icon

PLACID: Identity-Preserving Multi-Object Compositing via Video Diffusion with Synthetic Trajectories

Add code
Jan 30, 2026
Viaarxiv icon

SimGraph: A Unified Framework for Scene Graph-Based Image Generation and Editing

Add code
Jan 29, 2026
Viaarxiv icon

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Add code
Jan 29, 2026
Viaarxiv icon

Say Cheese! Detail-Preserving Portrait Collection Generation via Natural Language Edits

Add code
Jan 28, 2026
Viaarxiv icon

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Add code
Jan 29, 2026
Viaarxiv icon

NuiWorld: Exploring a Scalable Framework for End-to-End Controllable World Generation

Add code
Jan 27, 2026
Viaarxiv icon