Picture for Tingbo Hou

Tingbo Hou

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

MoCha: Towards Movie-Grade Talking Character Synthesis

Add code
Mar 30, 2025
Viaarxiv icon

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Add code
Jan 16, 2025
Figure 1 for Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Figure 2 for Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Figure 3 for Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Figure 4 for Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
Viaarxiv icon

DirectorLLM for Human-Centric Video Generation

Add code
Dec 19, 2024
Figure 1 for DirectorLLM for Human-Centric Video Generation
Figure 2 for DirectorLLM for Human-Centric Video Generation
Figure 3 for DirectorLLM for Human-Centric Video Generation
Figure 4 for DirectorLLM for Human-Centric Video Generation
Viaarxiv icon

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Figure 1 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 2 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 3 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 4 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Imagen 3

Add code
Aug 13, 2024
Viaarxiv icon

EM Distillation for One-step Diffusion Models

Add code
May 27, 2024
Viaarxiv icon

3D Congealing: 3D-Aware Image Alignment in the Wild

Add code
Apr 02, 2024
Figure 1 for 3D Congealing: 3D-Aware Image Alignment in the Wild
Figure 2 for 3D Congealing: 3D-Aware Image Alignment in the Wild
Figure 3 for 3D Congealing: 3D-Aware Image Alignment in the Wild
Figure 4 for 3D Congealing: 3D-Aware Image Alignment in the Wild
Viaarxiv icon

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Add code
Feb 13, 2024
Figure 1 for PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Figure 2 for PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Figure 3 for PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Figure 4 for PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Viaarxiv icon