Picture for Shijie Zhou

Shijie Zhou

SpatialStack: Layered Geometry-Language Fusion for 3D VLM Spatial Reasoning

Add code
Mar 28, 2026
Viaarxiv icon

Anticipatory Planning for Multimodal AI Agents

Add code
Mar 17, 2026
Viaarxiv icon

Egocentric World Model for Photorealistic Hand-Object Interaction Synthesis

Add code
Mar 13, 2026
Viaarxiv icon

RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation

Add code
Mar 11, 2026
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

Simple Denoising Diffusion Language Models

Add code
Oct 27, 2025
Viaarxiv icon

MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator

Add code
Oct 05, 2025
Figure 1 for MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Figure 2 for MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Figure 3 for MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Figure 4 for MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
Viaarxiv icon

SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

Add code
Jun 18, 2025
Viaarxiv icon

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Add code
Mar 26, 2025
Viaarxiv icon

X-Dyna: Expressive Dynamic Human Image Animation

Add code
Jan 20, 2025
Viaarxiv icon