Picture for Shuai Yang

Shuai Yang

LongLive: Real-time Interactive Long Video Generation

Add code
Sep 26, 2025
Viaarxiv icon

LINR Bridge: Vector Graphic Animation via Neural Implicits and Video Diffusion Priors

Add code
Sep 09, 2025
Viaarxiv icon

ANYPORTAL: Zero-Shot Consistent Video Background Replacement

Add code
Sep 09, 2025
Viaarxiv icon

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Add code
Aug 14, 2025
Viaarxiv icon

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Viaarxiv icon

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Add code
Jun 12, 2025
Viaarxiv icon

Video World Models with Long-term Spatial Memory

Add code
Jun 05, 2025
Figure 1 for Video World Models with Long-term Spatial Memory
Figure 2 for Video World Models with Long-term Spatial Memory
Figure 3 for Video World Models with Long-term Spatial Memory
Figure 4 for Video World Models with Long-term Spatial Memory
Viaarxiv icon

Training-Free Watermarking for Autoregressive Image Generation

Add code
May 20, 2025
Figure 1 for Training-Free Watermarking for Autoregressive Image Generation
Figure 2 for Training-Free Watermarking for Autoregressive Image Generation
Figure 3 for Training-Free Watermarking for Autoregressive Image Generation
Figure 4 for Training-Free Watermarking for Autoregressive Image Generation
Viaarxiv icon

On the Eligibility of LLMs for Counterfactual Reasoning: A Decompositional Study

Add code
May 17, 2025
Viaarxiv icon