Picture for Jiatao Gu

Jiatao Gu

STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

Add code
Jun 06, 2025
Viaarxiv icon

Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation

Add code
Jun 06, 2025
Viaarxiv icon

Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Add code
Feb 25, 2025
Viaarxiv icon

Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation

Add code
Jan 09, 2025
Figure 1 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 2 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 3 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Figure 4 for Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
Viaarxiv icon

3D Shape Tokenization

Add code
Dec 24, 2024
Figure 1 for 3D Shape Tokenization
Figure 2 for 3D Shape Tokenization
Figure 3 for 3D Shape Tokenization
Figure 4 for 3D Shape Tokenization
Viaarxiv icon

DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models

Add code
Dec 11, 2024
Figure 1 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 2 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 3 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Figure 4 for DSplats: 3D Generation by Denoising Splats-Based Multiview Diffusion Models
Viaarxiv icon

Normalizing Flows are Capable Generative Models

Add code
Dec 10, 2024
Viaarxiv icon

World-consistent Video Diffusion with Explicit 3D Modeling

Add code
Dec 02, 2024
Viaarxiv icon

TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models

Add code
Nov 02, 2024
Figure 1 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 2 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 3 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Figure 4 for TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models
Viaarxiv icon

DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation

Add code
Oct 10, 2024
Figure 1 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 2 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 3 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Figure 4 for DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation
Viaarxiv icon