Picture for Jiaxin Shi

Jiaxin Shi

3DThinkVLA: Endowing Vision-Language-Action Models with Latent 3D Priors via 3D-Thinking-Guided Co-training

Add code
Jun 03, 2026
Viaarxiv icon

Variational Learning for Insertion-based Generation

Add code
Jun 01, 2026
Viaarxiv icon

The Efficiency Gap in Byte Modeling

Add code
May 13, 2026
Viaarxiv icon

RealCam: Real-Time Novel-View Video Generation with Interactive Camera Control

Add code
May 07, 2026
Viaarxiv icon

PASR: Pose-Aware 3D Shape Retrieval from Occluded Single Views

Add code
Apr 24, 2026
Viaarxiv icon

DiT as Real-Time Rerenderer: Streaming Video Stylization with Autoregressive Diffusion Transformer

Add code
Apr 15, 2026
Viaarxiv icon

Generative Frontiers: Why Evaluation Matters for Diffusion Language Models

Add code
Apr 03, 2026
Viaarxiv icon

CoMo: Compositional Motion Customization for Text-to-Video Generation

Add code
Oct 27, 2025
Viaarxiv icon

CANDI: Hybrid Discrete-Continuous Diffusion Models

Add code
Oct 26, 2025
Viaarxiv icon

Learning-Order Autoregressive Models with Application to Molecular Graph Generation

Add code
Mar 07, 2025
Figure 1 for Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Figure 2 for Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Figure 3 for Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Figure 4 for Learning-Order Autoregressive Models with Application to Molecular Graph Generation
Viaarxiv icon