Picture for Dongdong Chen

Dongdong Chen

Diffusion Posterior Proximal Sampling for Image Restoration

Add code
Feb 25, 2024
Figure 1 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 2 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 3 for Diffusion Posterior Proximal Sampling for Image Restoration
Figure 4 for Diffusion Posterior Proximal Sampling for Image Restoration
Viaarxiv icon

Image Fusion via Vision-Language Model

Add code
Feb 03, 2024
Figure 1 for Image Fusion via Vision-Language Model
Figure 2 for Image Fusion via Vision-Language Model
Figure 3 for Image Fusion via Vision-Language Model
Figure 4 for Image Fusion via Vision-Language Model
Viaarxiv icon

Towards More Unified In-context Visual Understanding

Add code
Dec 05, 2023
Figure 1 for Towards More Unified In-context Visual Understanding
Figure 2 for Towards More Unified In-context Visual Understanding
Figure 3 for Towards More Unified In-context Visual Understanding
Figure 4 for Towards More Unified In-context Visual Understanding
Viaarxiv icon

Mesh-Guided Neural Implicit Field Editing

Add code
Dec 04, 2023
Figure 1 for Mesh-Guided Neural Implicit Field Editing
Figure 2 for Mesh-Guided Neural Implicit Field Editing
Figure 3 for Mesh-Guided Neural Implicit Field Editing
Figure 4 for Mesh-Guided Neural Implicit Field Editing
Viaarxiv icon

TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios

Add code
Nov 30, 2023
Figure 1 for TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios
Figure 2 for TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios
Figure 3 for TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios
Figure 4 for TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios
Viaarxiv icon

Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models

Add code
Nov 28, 2023
Figure 1 for Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models
Figure 2 for Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models
Figure 3 for Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models
Figure 4 for Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models
Viaarxiv icon

Traffic Video Object Detection using Motion Prior

Add code
Nov 16, 2023
Figure 1 for Traffic Video Object Detection using Motion Prior
Figure 2 for Traffic Video Object Detection using Motion Prior
Figure 3 for Traffic Video Object Detection using Motion Prior
Figure 4 for Traffic Video Object Detection using Motion Prior
Viaarxiv icon

Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images

Add code
Nov 14, 2023
Figure 1 for Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images
Figure 2 for Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images
Figure 3 for Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images
Figure 4 for Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images
Viaarxiv icon

PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders

Add code
Nov 08, 2023
Viaarxiv icon

On the Hidden Waves of Image

Add code
Oct 19, 2023
Figure 1 for On the Hidden Waves of Image
Figure 2 for On the Hidden Waves of Image
Figure 3 for On the Hidden Waves of Image
Figure 4 for On the Hidden Waves of Image
Viaarxiv icon