Picture for Peizhao Zhang

Peizhao Zhang

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

REFA: Real-time Egocentric Facial Animations for Virtual Reality

Add code
Jan 07, 2026
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

DirectorLLM for Human-Centric Video Generation

Add code
Dec 19, 2024
Figure 1 for DirectorLLM for Human-Centric Video Generation
Figure 2 for DirectorLLM for Human-Centric Video Generation
Figure 3 for DirectorLLM for Human-Centric Video Generation
Figure 4 for DirectorLLM for Human-Centric Video Generation
Viaarxiv icon

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Figure 1 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 2 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 3 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 4 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

An Analysis on Quantizing Diffusion Transformers

Add code
Jun 16, 2024
Figure 1 for An Analysis on Quantizing Diffusion Transformers
Figure 2 for An Analysis on Quantizing Diffusion Transformers
Figure 3 for An Analysis on Quantizing Diffusion Transformers
Figure 4 for An Analysis on Quantizing Diffusion Transformers
Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Add code
Dec 29, 2023
Figure 1 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 2 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 3 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 4 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Viaarxiv icon

Efficient Quantization Strategies for Latent Diffusion Models

Add code
Dec 09, 2023
Figure 1 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 2 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 3 for Efficient Quantization Strategies for Latent Diffusion Models
Figure 4 for Efficient Quantization Strategies for Latent Diffusion Models
Viaarxiv icon

ControlRoom3D: Room Generation using Semantic Proxy Rooms

Add code
Dec 08, 2023
Figure 1 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 2 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 3 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Figure 4 for ControlRoom3D: Room Generation using Semantic Proxy Rooms
Viaarxiv icon