Picture for Pei Cheng

Pei Cheng

EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts

Add code
Jun 13, 2024
Figure 1 for EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Figure 2 for EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Figure 3 for EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Figure 4 for EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts
Viaarxiv icon

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Add code
Mar 08, 2024
Figure 1 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 2 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 3 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Figure 4 for ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Viaarxiv icon

FaceStudio: Put Your Face Everywhere in Seconds

Add code
Dec 06, 2023
Figure 1 for FaceStudio: Put Your Face Everywhere in Seconds
Figure 2 for FaceStudio: Put Your Face Everywhere in Seconds
Figure 3 for FaceStudio: Put Your Face Everywhere in Seconds
Figure 4 for FaceStudio: Put Your Face Everywhere in Seconds
Viaarxiv icon

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Add code
Jul 03, 2023
Figure 1 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 2 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 3 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 4 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Viaarxiv icon

Learning Variational Motion Prior for Video-based Motion Capture

Add code
Oct 28, 2022
Figure 1 for Learning Variational Motion Prior for Video-based Motion Capture
Figure 2 for Learning Variational Motion Prior for Video-based Motion Capture
Figure 3 for Learning Variational Motion Prior for Video-based Motion Capture
Figure 4 for Learning Variational Motion Prior for Video-based Motion Capture
Viaarxiv icon

Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations

Add code
Oct 20, 2022
Figure 1 for Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations
Figure 2 for Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations
Figure 3 for Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations
Figure 4 for Coordinates Are NOT Lonely -- Codebook Prior Helps Implicit Neural 3D Representations
Viaarxiv icon

NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results

Add code
Apr 25, 2022
Figure 1 for NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
Figure 2 for NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
Figure 3 for NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
Figure 4 for NTIRE 2022 Challenge on Super-Resolution and Quality Enhancement of Compressed Video: Dataset, Methods and Results
Viaarxiv icon

Shuffle Transformer with Feature Alignment for Video Face Parsing

Add code
Jun 16, 2021
Figure 1 for Shuffle Transformer with Feature Alignment for Video Face Parsing
Figure 2 for Shuffle Transformer with Feature Alignment for Video Face Parsing
Figure 3 for Shuffle Transformer with Feature Alignment for Video Face Parsing
Viaarxiv icon

Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer

Add code
Jun 07, 2021
Figure 1 for Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Figure 2 for Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Figure 3 for Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Figure 4 for Shuffle Transformer: Rethinking Spatial Shuffle for Vision Transformer
Viaarxiv icon