Picture for Xiao Dong

Xiao Dong

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Viaarxiv icon

SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction

Add code
Jul 06, 2024
Viaarxiv icon

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

Add code
Apr 25, 2024
Viaarxiv icon

DRSM: efficient neural 4d decomposition for dynamic reconstruction in stationary monocular cameras

Add code
Feb 01, 2024
Viaarxiv icon

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning

Add code
Jun 01, 2023
Figure 1 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 2 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 3 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Figure 4 for UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
Viaarxiv icon

Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval

Add code
Jun 17, 2022
Figure 1 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 2 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 3 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 4 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Viaarxiv icon

Worst-Case Dynamic Power Distribution Network Noise Prediction Using Convolutional Neural Network

Add code
Apr 27, 2022
Figure 1 for Worst-Case Dynamic Power Distribution Network Noise Prediction Using Convolutional Neural Network
Figure 2 for Worst-Case Dynamic Power Distribution Network Noise Prediction Using Convolutional Neural Network
Figure 3 for Worst-Case Dynamic Power Distribution Network Noise Prediction Using Convolutional Neural Network
Figure 4 for Worst-Case Dynamic Power Distribution Network Noise Prediction Using Convolutional Neural Network
Viaarxiv icon

elBERto: Self-supervised Commonsense Learning for Question Answering

Add code
Mar 17, 2022
Figure 1 for elBERto: Self-supervised Commonsense Learning for Question Answering
Figure 2 for elBERto: Self-supervised Commonsense Learning for Question Answering
Figure 3 for elBERto: Self-supervised Commonsense Learning for Question Answering
Figure 4 for elBERto: Self-supervised Commonsense Learning for Question Answering
Viaarxiv icon

M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks

Add code
Sep 09, 2021
Figure 1 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 2 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 3 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 4 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Viaarxiv icon

Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining

Add code
Aug 09, 2021
Figure 1 for Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining
Figure 2 for Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining
Figure 3 for Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining
Figure 4 for Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining
Viaarxiv icon