Picture for Dan Xu

Dan Xu

SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis

Add code
Nov 06, 2023
Viaarxiv icon

CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Add code
Oct 04, 2023
Viaarxiv icon

MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

Add code
Aug 06, 2023
Figure 1 for MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
Figure 2 for MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
Figure 3 for MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
Figure 4 for MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation
Viaarxiv icon

Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis

Add code
Aug 05, 2023
Figure 1 for Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
Figure 2 for Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
Figure 3 for Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
Figure 4 for Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesis
Viaarxiv icon

TaskExpert: Dynamically Assembling Multi-Task Representations with Memorial Mixture-of-Experts

Add code
Jul 28, 2023
Viaarxiv icon

Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation

Add code
Jul 20, 2023
Viaarxiv icon

Contrastive Multi-Task Dense Prediction

Add code
Jul 16, 2023
Figure 1 for Contrastive Multi-Task Dense Prediction
Figure 2 for Contrastive Multi-Task Dense Prediction
Figure 3 for Contrastive Multi-Task Dense Prediction
Figure 4 for Contrastive Multi-Task Dense Prediction
Viaarxiv icon

InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding

Add code
Jun 08, 2023
Figure 1 for InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Figure 2 for InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Figure 3 for InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Figure 4 for InvPT++: Inverted Pyramid Multi-Task Transformer for Visual Scene Understanding
Viaarxiv icon

DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

Add code
May 10, 2023
Viaarxiv icon

DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment

Add code
Apr 10, 2023
Figure 1 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 2 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 3 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Figure 4 for DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment
Viaarxiv icon