Picture for Yinghao Xu

Yinghao Xu

JODA: Composable Joint Dynamics for Articulated Objects

Add code
May 11, 2026
Viaarxiv icon

Geometric Context Transformer for Streaming 3D Reconstruction

Add code
Apr 16, 2026
Viaarxiv icon

SceneScribe-1M: A Large-Scale Video Dataset with Comprehensive Geometric and Semantic Annotations

Add code
Apr 09, 2026
Viaarxiv icon

Causal World Modeling for Robot Control

Add code
Jan 29, 2026
Viaarxiv icon

Advancing Open-source World Models

Add code
Jan 28, 2026
Viaarxiv icon

Masked Depth Modeling for Spatial Perception

Add code
Jan 25, 2026
Viaarxiv icon

Mixture of Contexts for Long Video Generation

Add code
Aug 28, 2025
Viaarxiv icon

Video World Models with Long-term Spatial Memory

Add code
Jun 05, 2025
Figure 1 for Video World Models with Long-term Spatial Memory
Figure 2 for Video World Models with Long-term Spatial Memory
Figure 3 for Video World Models with Long-term Spatial Memory
Figure 4 for Video World Models with Long-term Spatial Memory
Viaarxiv icon

Interspatial Attention for Efficient 4D Human Video Generation

Add code
May 21, 2025
Figure 1 for Interspatial Attention for Efficient 4D Human Video Generation
Figure 2 for Interspatial Attention for Efficient 4D Human Video Generation
Figure 3 for Interspatial Attention for Efficient 4D Human Video Generation
Figure 4 for Interspatial Attention for Efficient 4D Human Video Generation
Viaarxiv icon

CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon