Picture for Liangliang Cao

Liangliang Cao

Diffusion Model-Based Image Editing: A Survey

Add code
Feb 27, 2024
Figure 1 for Diffusion Model-Based Image Editing: A Survey
Figure 2 for Diffusion Model-Based Image Editing: A Survey
Figure 3 for Diffusion Model-Based Image Editing: A Survey
Figure 4 for Diffusion Model-Based Image Editing: A Survey
Viaarxiv icon

Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

Add code
Dec 26, 2023
Figure 1 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 2 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 3 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Figure 4 for Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models
Viaarxiv icon

Ferret: Refer and Ground Anything Anywhere at Any Granularity

Add code
Oct 11, 2023
Figure 1 for Ferret: Refer and Ground Anything Anywhere at Any Granularity
Figure 2 for Ferret: Refer and Ground Anything Anywhere at Any Granularity
Figure 3 for Ferret: Refer and Ground Anything Anywhere at Any Granularity
Figure 4 for Ferret: Refer and Ground Anything Anywhere at Any Granularity
Viaarxiv icon

Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day

Add code
Oct 04, 2023
Figure 1 for Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Figure 2 for Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Figure 3 for Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Figure 4 for Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day
Viaarxiv icon

Instruction-Following Speech Recognition

Add code
Sep 18, 2023
Figure 1 for Instruction-Following Speech Recognition
Figure 2 for Instruction-Following Speech Recognition
Figure 3 for Instruction-Following Speech Recognition
Figure 4 for Instruction-Following Speech Recognition
Viaarxiv icon

RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture

Add code
May 18, 2023
Figure 1 for RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Figure 2 for RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Figure 3 for RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Figure 4 for RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture
Viaarxiv icon

Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness

Add code
May 08, 2023
Figure 1 for Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness
Figure 2 for Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness
Figure 3 for Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness
Figure 4 for Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness
Viaarxiv icon

STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

Add code
Feb 08, 2023
Figure 1 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 2 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 3 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Figure 4 for STAIR: Learning Sparse Text and Image Representation in Grounded Tokens
Viaarxiv icon

Exploiting Category Names for Few-Shot Classification with Vision-Language Models

Add code
Dec 04, 2022
Figure 1 for Exploiting Category Names for Few-Shot Classification with Vision-Language Models
Figure 2 for Exploiting Category Names for Few-Shot Classification with Vision-Language Models
Figure 3 for Exploiting Category Names for Few-Shot Classification with Vision-Language Models
Figure 4 for Exploiting Category Names for Few-Shot Classification with Vision-Language Models
Viaarxiv icon

SurFit: Learning to Fit Surfaces Improves Few Shot Learning on Point Clouds

Add code
Dec 27, 2021
Figure 1 for SurFit: Learning to Fit Surfaces Improves Few Shot Learning on Point Clouds
Figure 2 for SurFit: Learning to Fit Surfaces Improves Few Shot Learning on Point Clouds
Figure 3 for SurFit: Learning to Fit Surfaces Improves Few Shot Learning on Point Clouds
Figure 4 for SurFit: Learning to Fit Surfaces Improves Few Shot Learning on Point Clouds
Viaarxiv icon