Picture for Yuechen Zhang

Yuechen Zhang

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance

Add code
Jun 24, 2024
Viaarxiv icon

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

Prompt Highlighter: Interactive Control for Multi-Modal LLMs

Add code
Dec 07, 2023
Viaarxiv icon

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance

Add code
Jun 01, 2023
Figure 1 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 2 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 3 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Figure 4 for Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance
Viaarxiv icon

Real-World Image Variation by Aligning Diffusion Inversion Chain

Add code
May 30, 2023
Figure 1 for Real-World Image Variation by Aligning Diffusion Inversion Chain
Figure 2 for Real-World Image Variation by Aligning Diffusion Inversion Chain
Figure 3 for Real-World Image Variation by Aligning Diffusion Inversion Chain
Figure 4 for Real-World Image Variation by Aligning Diffusion Inversion Chain
Viaarxiv icon

Video-P2P: Video Editing with Cross-attention Control

Add code
Mar 08, 2023
Figure 1 for Video-P2P: Video Editing with Cross-attention Control
Figure 2 for Video-P2P: Video Editing with Cross-attention Control
Figure 3 for Video-P2P: Video Editing with Cross-attention Control
Figure 4 for Video-P2P: Video Editing with Cross-attention Control
Viaarxiv icon

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

Add code
Jan 06, 2023
Figure 1 for CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Figure 2 for CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Figure 3 for CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Figure 4 for CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Viaarxiv icon

Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields

Add code
Dec 06, 2022
Figure 1 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 2 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 3 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Figure 4 for Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields
Viaarxiv icon

High Quality Segmentation for Ultra High-resolution Images

Add code
Dec 26, 2021
Figure 1 for High Quality Segmentation for Ultra High-resolution Images
Figure 2 for High Quality Segmentation for Ultra High-resolution Images
Figure 3 for High Quality Segmentation for Ultra High-resolution Images
Figure 4 for High Quality Segmentation for Ultra High-resolution Images
Viaarxiv icon