Picture for Dongdong Chen

Dongdong Chen

SynChart: Synthesizing Charts from Language Models

Add code
Sep 25, 2024
Viaarxiv icon

Pluralistic Salient Object Detection

Add code
Sep 04, 2024
Figure 1 for Pluralistic Salient Object Detection
Figure 2 for Pluralistic Salient Object Detection
Figure 3 for Pluralistic Salient Object Detection
Figure 4 for Pluralistic Salient Object Detection
Viaarxiv icon

Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM

Add code
Jul 31, 2024
Viaarxiv icon

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

Add code
Jul 05, 2024
Viaarxiv icon

Transformer based Pluralistic Image Completion with Reduced Information Loss

Add code
Apr 15, 2024
Figure 1 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 2 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 3 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Figure 4 for Transformer based Pluralistic Image Completion with Reduced Information Loss
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Figure 1 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 2 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 3 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 4 for OmniVid: A Generative Framework for Universal Video Understanding
Viaarxiv icon

Generative Enhancement for 3D Medical Images

Add code
Mar 19, 2024
Figure 1 for Generative Enhancement for 3D Medical Images
Figure 2 for Generative Enhancement for 3D Medical Images
Figure 3 for Generative Enhancement for 3D Medical Images
Figure 4 for Generative Enhancement for 3D Medical Images
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Mar 18, 2024
Figure 1 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 2 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 3 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Figure 4 for Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
Viaarxiv icon

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Add code
Mar 15, 2024
Figure 1 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 2 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 3 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 4 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Viaarxiv icon

Diffusion Posterior Proximal Sampling for Image Restoration

Add code
Feb 25, 2024
Figure 1 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 2 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 3 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 4 for Diffusion Posterior Proximal Sampling for Image Restoration
Viaarxiv icon