Picture for Dongdong Chen

Dongdong Chen

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

Add code
Jul 05, 2024
Viaarxiv icon

Transformer based Pluralistic Image Completion with Reduced Information Loss

Add code
Apr 15, 2024
Figure 1 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 2 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 3 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 4 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Figure 1 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 2 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 3 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 4 for OmniVid: A Generative Framework for Universal Video Understanding
Viaarxiv icon

Generative Enhancement for 3D Medical Images

Add code
Mar 19, 2024
Figure 1 for Generative Enhancement for 3D Medical Images
Figure 2 for Generative Enhancement for 3D Medical Images
Figure 3 for Generative Enhancement for 3D Medical Images
Figure 4 for Generative Enhancement for 3D Medical Images
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Mar 18, 2024
Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Add code
Mar 15, 2024
Figure 1 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 2 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 3 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 4 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Viaarxiv icon

Diffusion Posterior Proximal Sampling for Image Restoration

Add code
Feb 25, 2024
Figure 1 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 2 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 3 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 4 for Diffusion Posterior Proximal Sampling for Image Restoration
Viaarxiv icon

Image Fusion via Vision-Language Model

Add code
Feb 03, 2024
Figure 1 for Image Fusion via Vision-Language Model
Figure 2 for Image Fusion via Vision-Language Model
Figure 3 for Image Fusion via Vision-Language Model
Figure 4 for Image Fusion via Vision-Language Model
Viaarxiv icon

Towards More Unified In-context Visual Understanding

Add code
Dec 05, 2023
Figure 1 for Towards More Unified In-context Visual Understanding
Figure 2 for Towards More Unified In-context Visual Understanding
Figure 3 for Towards More Unified In-context Visual Understanding
Figure 4 for Towards More Unified In-context Visual Understanding
Viaarxiv icon

Mesh-Guided Neural Implicit Field Editing

Add code
Dec 04, 2023
Figure 1 for Mesh-Guided Neural Implicit Field Editing
Figure 2 for Mesh-Guided Neural Implicit Field Editing
Figure 3 for Mesh-Guided Neural Implicit Field Editing
Figure 4 for Mesh-Guided Neural Implicit Field Editing
Viaarxiv icon