Picture for Yuhui Yuan

Yuhui Yuan

Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering

Add code
Jun 14, 2024
Figure 1 for Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Figure 2 for Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Figure 3 for Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Figure 4 for Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering
Viaarxiv icon

FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation

Add code
Jun 12, 2024
Viaarxiv icon

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Add code
Jun 06, 2024
Viaarxiv icon

DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing

Add code
Mar 21, 2024
Figure 1 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 2 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 3 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Figure 4 for DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing
Viaarxiv icon

Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering

Add code
Mar 14, 2024
Figure 1 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 2 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 3 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Figure 4 for Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering
Viaarxiv icon

Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior

Add code
Dec 15, 2023
Viaarxiv icon

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Add code
Nov 30, 2023
Figure 1 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 2 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 3 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Figure 4 for ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models
Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Add code
Nov 30, 2023
Figure 1 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 2 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 3 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Figure 4 for MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
Viaarxiv icon

COLE: A Hierarchical Generation Framework for Graphic Design

Add code
Nov 28, 2023
Figure 1 for COLE: A Hierarchical Generation Framework for Graphic Design
Figure 2 for COLE: A Hierarchical Generation Framework for Graphic Design
Figure 3 for COLE: A Hierarchical Generation Framework for Graphic Design
Figure 4 for COLE: A Hierarchical Generation Framework for Graphic Design
Viaarxiv icon

Rank-DETR for High Quality Object Detection

Add code
Oct 19, 2023
Figure 1 for Rank-DETR for High Quality Object Detection
Figure 2 for Rank-DETR for High Quality Object Detection
Figure 3 for Rank-DETR for High Quality Object Detection
Figure 4 for Rank-DETR for High Quality Object Detection
Viaarxiv icon