Picture for Tao Mei

Tao Mei

Prompt Refinement with Image Pivot for Text-to-Image Generation

Add code
Jun 28, 2024
Viaarxiv icon

Boosting Diffusion Models with Moving Average Sampling in Frequency Domain

Add code
Mar 26, 2024
Figure 1 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 2 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 3 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Figure 4 for Boosting Diffusion Models with Moving Average Sampling in Frequency Domain
Viaarxiv icon

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Add code
Mar 25, 2024
Viaarxiv icon

TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models

Add code
Mar 25, 2024
Figure 1 for TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Figure 2 for TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Figure 3 for TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Figure 4 for TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
Viaarxiv icon

SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer

Add code
Mar 25, 2024
Figure 1 for SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Figure 2 for SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Figure 3 for SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Figure 4 for SD-DiT: Unleashing the Power of Self-supervised Discrimination in Diffusion Transformer
Viaarxiv icon

VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation

Add code
Mar 25, 2024
Viaarxiv icon

HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs

Add code
Mar 18, 2024
Figure 1 for HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Figure 2 for HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Figure 3 for HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Figure 4 for HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
Viaarxiv icon

VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM

Add code
Jan 02, 2024
Viaarxiv icon

Control3D: Towards Controllable Text-to-3D Generation

Add code
Nov 09, 2023
Figure 1 for Control3D: Towards Controllable Text-to-3D Generation
Figure 2 for Control3D: Towards Controllable Text-to-3D Generation
Figure 3 for Control3D: Towards Controllable Text-to-3D Generation
Figure 4 for Control3D: Towards Controllable Text-to-3D Generation
Viaarxiv icon

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors

Add code
Nov 09, 2023
Figure 1 for ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Figure 2 for ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Figure 3 for ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Figure 4 for ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Viaarxiv icon