Picture for Tianyu He

Tianyu He

UEPS: Robust and Efficient MRI Reconstruction

Add code
Mar 19, 2026
Viaarxiv icon

Beyond Pixel Histories: World Models with Persistent 3D State

Add code
Mar 03, 2026
Viaarxiv icon

LIVE: Long-horizon Interactive Video World Modeling

Add code
Feb 03, 2026
Viaarxiv icon

Luminark: Training-free, Probabilistically-Certified Watermarking for General Vision Generative Models

Add code
Jan 03, 2026
Viaarxiv icon

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Add code
Jul 10, 2025
Viaarxiv icon

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Add code
Apr 11, 2025
Figure 1 for MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
Figure 2 for MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
Figure 3 for MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
Figure 4 for MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
Viaarxiv icon

Fast Autoregressive Video Generation with Diagonal Decoding

Add code
Mar 18, 2025
Figure 1 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 2 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 3 for Fast Autoregressive Video Generation with Diagonal Decoding
Figure 4 for Fast Autoregressive Video Generation with Diagonal Decoding
Viaarxiv icon

HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Add code
Mar 14, 2025
Figure 1 for HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
Figure 2 for HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
Figure 3 for HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
Figure 4 for HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
Viaarxiv icon

AR4D: Autoregressive 4D Generation from Monocular Videos

Add code
Jan 03, 2025
Figure 1 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 2 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 3 for AR4D: Autoregressive 4D Generation from Monocular Videos
Figure 4 for AR4D: Autoregressive 4D Generation from Monocular Videos
Viaarxiv icon

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

Add code
Jan 02, 2025
Figure 1 for 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
Figure 2 for 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
Figure 3 for 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
Figure 4 for 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
Viaarxiv icon