Picture for Zhen Xing

Zhen Xing

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Figure 1 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 2 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 3 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 4 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Viaarxiv icon

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

Add code
Jun 10, 2024
Viaarxiv icon

FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model

Add code
Mar 15, 2024
Figure 1 for FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Figure 2 for FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Figure 3 for FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Figure 4 for FDGaussian: Fast Gaussian Splatting from Single Image via Geometric-aware Diffusion Model
Viaarxiv icon

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

Add code
Nov 30, 2023
Figure 1 for VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Figure 2 for VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Figure 3 for VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Figure 4 for VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models
Viaarxiv icon

AdaDiff: Adaptive Step Selection for Fast Diffusion

Add code
Nov 24, 2023
Figure 1 for AdaDiff: Adaptive Step Selection for Fast Diffusion
Figure 2 for AdaDiff: Adaptive Step Selection for Fast Diffusion
Figure 3 for AdaDiff: Adaptive Step Selection for Fast Diffusion
Figure 4 for AdaDiff: Adaptive Step Selection for Fast Diffusion
Viaarxiv icon

A Survey on Video Diffusion Models

Add code
Oct 16, 2023
Figure 1 for A Survey on Video Diffusion Models
Figure 2 for A Survey on Video Diffusion Models
Figure 3 for A Survey on Video Diffusion Models
Figure 4 for A Survey on Video Diffusion Models
Viaarxiv icon

Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models

Add code
Sep 15, 2023
Figure 1 for Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models
Figure 2 for Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models
Figure 3 for Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models
Figure 4 for Unsupervised Disentangling of Facial Representations with 3D-aware Latent Diffusion Models
Viaarxiv icon

PanoSwin: a Pano-style Swin Transformer for Panorama Understanding

Add code
Aug 28, 2023
Figure 1 for PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Figure 2 for PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Figure 3 for PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Figure 4 for PanoSwin: a Pano-style Swin Transformer for Panorama Understanding
Viaarxiv icon

SimDA: Simple Diffusion Adapter for Efficient Video Generation

Add code
Aug 18, 2023
Figure 1 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 2 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 3 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Figure 4 for SimDA: Simple Diffusion Adapter for Efficient Video Generation
Viaarxiv icon

TranSFormer: Slow-Fast Transformer for Machine Translation

Add code
May 26, 2023
Figure 1 for TranSFormer: Slow-Fast Transformer for Machine Translation
Figure 2 for TranSFormer: Slow-Fast Transformer for Machine Translation
Figure 3 for TranSFormer: Slow-Fast Transformer for Machine Translation
Figure 4 for TranSFormer: Slow-Fast Transformer for Machine Translation
Viaarxiv icon