Picture for Ce Liu

Ce Liu

NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations

Add code
Jun 15, 2023
Figure 1 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 2 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 3 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Figure 4 for NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations
Viaarxiv icon

Indiscernible Object Counting in Underwater Scenes

Add code
Apr 23, 2023
Figure 1 for Indiscernible Object Counting in Underwater Scenes
Figure 2 for Indiscernible Object Counting in Underwater Scenes
Figure 3 for Indiscernible Object Counting in Underwater Scenes
Figure 4 for Indiscernible Object Counting in Underwater Scenes
Viaarxiv icon

Single Image Depth Prediction Made Better: A Multivariate Gaussian Take

Add code
Apr 18, 2023
Viaarxiv icon

MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action

Add code
Mar 20, 2023
Figure 1 for MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Figure 2 for MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Figure 3 for MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Figure 4 for MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
Viaarxiv icon

VA-DepthNet: A Variational Approach to Single Image Depth Prediction

Add code
Feb 15, 2023
Figure 1 for VA-DepthNet: A Variational Approach to Single Image Depth Prediction
Figure 2 for VA-DepthNet: A Variational Approach to Single Image Depth Prediction
Figure 3 for VA-DepthNet: A Variational Approach to Single Image Depth Prediction
Figure 4 for VA-DepthNet: A Variational Approach to Single Image Depth Prediction
Viaarxiv icon

Learning Customized Visual Models with Retrieval-Augmented Knowledge

Add code
Jan 17, 2023
Figure 1 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 2 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 3 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 4 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Viaarxiv icon

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

Add code
Dec 07, 2022
Figure 1 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 2 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 3 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 4 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Viaarxiv icon

ReCo: Region-Controlled Text-to-Image Generation

Add code
Nov 23, 2022
Viaarxiv icon

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Add code
Sep 15, 2022
Figure 1 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 2 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 3 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 4 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Viaarxiv icon

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Add code
Jun 15, 2022
Figure 1 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 2 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 3 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Figure 4 for Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
Viaarxiv icon