Picture for Guande He

Guande He

Causality in Video Diffusers is Separable from Denoising

Add code
Feb 10, 2026
Viaarxiv icon

Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation

Add code
Feb 02, 2026
Viaarxiv icon

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Add code
Jun 09, 2025
Figure 1 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 2 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 3 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Figure 4 for Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Viaarxiv icon

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Add code
Mar 03, 2025
Viaarxiv icon

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Add code
Feb 21, 2025
Figure 1 for RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Figure 2 for RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Figure 3 for RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Figure 4 for RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers
Viaarxiv icon

Elucidating the Preconditioning in Consistency Distillation

Add code
Feb 05, 2025
Viaarxiv icon

Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models

Add code
Dec 19, 2024
Figure 1 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 2 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 3 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Figure 4 for Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models
Viaarxiv icon

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

Add code
Nov 26, 2024
Figure 1 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 2 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 3 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Figure 4 for Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Figure 1 for Consistency Diffusion Bridge Models
Figure 2 for Consistency Diffusion Bridge Models
Figure 3 for Consistency Diffusion Bridge Models
Figure 4 for Consistency Diffusion Bridge Models
Viaarxiv icon

Diffusion Bridge Implicit Models

Add code
May 24, 2024
Figure 1 for Diffusion Bridge Implicit Models
Figure 2 for Diffusion Bridge Implicit Models
Figure 3 for Diffusion Bridge Implicit Models
Figure 4 for Diffusion Bridge Implicit Models
Viaarxiv icon