Scene Graph Generation


Scene graph generation is the process of creating structured representations of scenes that capture the relationships between objects.

GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies

Add code
Nov 06, 2025
Viaarxiv icon

HiGS: Hierarchical Generative Scene Framework for Multi-Step Associative Semantic Spatial Composition

Add code
Oct 31, 2025
Viaarxiv icon

PoSh: Using Scene Graphs To Guide LLMs-as-a-Judge For Detailed Image Descriptions

Add code
Oct 21, 2025
Viaarxiv icon

Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics

Add code
Sep 26, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

Causal Reasoning Elicits Controllable 3D Scene Generation

Add code
Sep 18, 2025
Viaarxiv icon

UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning

Add code
Sep 26, 2025
Viaarxiv icon

Measuring Epistemic Humility in Multimodal Large Language Models

Add code
Sep 11, 2025
Viaarxiv icon

SATURN: Autoregressive Image Generation Guided by Scene Graphs

Add code
Aug 20, 2025
Viaarxiv icon

TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring

Add code
Aug 07, 2025
Viaarxiv icon