Picture for Feng Zhao

Feng Zhao

Prototype Clustered Diffusion Models for Versatile Inverse Problems

Add code
Jul 13, 2024
Viaarxiv icon

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models

Add code
Jul 11, 2024
Viaarxiv icon

PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation

Add code
Jun 28, 2024
Viaarxiv icon

Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model

Add code
Jun 27, 2024
Figure 1 for Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model
Figure 2 for Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model
Figure 3 for Using diffusion model as constraint: Empower Image Restoration Network Training with Diffusion Model
Viaarxiv icon

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

Add code
Jun 17, 2024
Figure 1 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 2 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 3 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Figure 4 for SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
Viaarxiv icon

Discrete Latent Perspective Learning for Segmentation and Detection

Add code
Jun 15, 2024
Figure 1 for Discrete Latent Perspective Learning for Segmentation and Detection
Figure 2 for Discrete Latent Perspective Learning for Segmentation and Detection
Figure 3 for Discrete Latent Perspective Learning for Segmentation and Detection
Figure 4 for Discrete Latent Perspective Learning for Segmentation and Detection
Viaarxiv icon

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

From Macro to Micro: Boosting micro-expression recognition via pre-training on macro-expression videos

Add code
May 26, 2024
Viaarxiv icon

Are We on the Right Way for Evaluating Large Vision-Language Models?

Add code
Apr 09, 2024
Figure 1 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 2 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 3 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Figure 4 for Are We on the Right Way for Evaluating Large Vision-Language Models?
Viaarxiv icon

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling

Add code
Apr 05, 2024
Figure 1 for GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Figure 2 for GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Figure 3 for GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Figure 4 for GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Viaarxiv icon