Picture for Yunpeng Zhang

Yunpeng Zhang

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Add code
Apr 07, 2025
Viaarxiv icon

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis

Add code
Mar 28, 2025
Viaarxiv icon

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Add code
Feb 19, 2025
Viaarxiv icon

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Add code
Dec 12, 2024
Figure 1 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 2 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 3 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Figure 4 for DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving
Viaarxiv icon

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Figure 1 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 2 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 3 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 4 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Viaarxiv icon

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 06, 2024
Viaarxiv icon

Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 05, 2024
Viaarxiv icon

FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models

Add code
Aug 15, 2024
Figure 1 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 2 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 3 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Figure 4 for FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models
Viaarxiv icon

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Add code
May 27, 2024
Figure 1 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 2 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 3 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Figure 4 for GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
Viaarxiv icon