Picture for Xuansong Xie

Xuansong Xie

DAMO Academy, Alibaba Group

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

Add code
Jan 12, 2024
Figure 1 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 2 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 3 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 4 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Viaarxiv icon

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data

Add code
Jan 02, 2024
Figure 1 for En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Figure 2 for En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Figure 3 for En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Figure 4 for En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Viaarxiv icon

Tracking with Human-Intent Reasoning

Add code
Dec 29, 2023
Viaarxiv icon

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors

Add code
Dec 29, 2023
Figure 1 for DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Figure 2 for DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Figure 3 for DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Figure 4 for DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Viaarxiv icon

DreaMoving: A Human Video Generation Framework based on Diffusion Models

Add code
Dec 11, 2023
Figure 1 for DreaMoving: A Human Video Generation Framework based on Diffusion Models
Figure 2 for DreaMoving: A Human Video Generation Framework based on Diffusion Models
Figure 3 for DreaMoving: A Human Video Generation Framework based on Diffusion Models
Figure 4 for DreaMoving: A Human Video Generation Framework based on Diffusion Models
Viaarxiv icon

Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning

Add code
Nov 22, 2023
Figure 1 for Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning
Figure 2 for Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning
Figure 3 for Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning
Figure 4 for Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning
Viaarxiv icon

Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models

Add code
Nov 22, 2023
Figure 1 for Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Figure 2 for Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Figure 3 for Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Figure 4 for Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Viaarxiv icon

FMViT: A multiple-frequency mixing Vision Transformer

Add code
Nov 09, 2023
Figure 1 for FMViT: A multiple-frequency mixing Vision Transformer
Figure 2 for FMViT: A multiple-frequency mixing Vision Transformer
Figure 3 for FMViT: A multiple-frequency mixing Vision Transformer
Figure 4 for FMViT: A multiple-frequency mixing Vision Transformer
Viaarxiv icon

AnyText: Multilingual Visual Text Generation And Editing

Add code
Nov 07, 2023
Figure 1 for AnyText: Multilingual Visual Text Generation And Editing
Figure 2 for AnyText: Multilingual Visual Text Generation And Editing
Figure 3 for AnyText: Multilingual Visual Text Generation And Editing
Figure 4 for AnyText: Multilingual Visual Text Generation And Editing
Viaarxiv icon

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models

Add code
Oct 20, 2023
Figure 1 for WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Figure 2 for WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Figure 3 for WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Figure 4 for WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models
Viaarxiv icon