Scene Graph Generation


Scene graph generation is the process of creating structured representations of scenes that capture the relationships between objects.

Lightweight Structured Multimodal Reasoning for Clinical Scene Understanding in Robotics

Add code
Sep 26, 2025
Viaarxiv icon

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning

Add code
Sep 26, 2025
Viaarxiv icon

Causal Reasoning Elicits Controllable 3D Scene Generation

Add code
Sep 18, 2025
Viaarxiv icon

Measuring Epistemic Humility in Multimodal Large Language Models

Add code
Sep 11, 2025
Viaarxiv icon

Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation

Add code
Sep 09, 2025
Viaarxiv icon

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

SATURN: Autoregressive Image Generation Guided by Scene Graphs

Add code
Aug 20, 2025
Viaarxiv icon

DrivingGaussian++: Towards Realistic Reconstruction and Editable Simulation for Surrounding Dynamic Driving Scenes

Add code
Aug 28, 2025
Viaarxiv icon

TRKT: Weakly Supervised Dynamic Scene Graph Generation with Temporal-enhanced Relation-aware Knowledge Transferring

Add code
Aug 07, 2025
Viaarxiv icon