Picture for Song Bai

Song Bai

Alibaba Group, University of Oxford

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

Add code
Feb 03, 2023
Viaarxiv icon

PV3D: A 3D Generative Model for Portrait Video Generation

Add code
Dec 13, 2022
Figure 1 for PV3D: A 3D Generative Model for Portrait Video Generation
Figure 2 for PV3D: A 3D Generative Model for Portrait Video Generation
Figure 3 for PV3D: A 3D Generative Model for Portrait Video Generation
Figure 4 for PV3D: A 3D Generative Model for Portrait Video Generation
Viaarxiv icon

Language-driven Open-Vocabulary 3D Scene Understanding

Add code
Nov 29, 2022
Figure 1 for Language-driven Open-Vocabulary 3D Scene Understanding
Figure 2 for Language-driven Open-Vocabulary 3D Scene Understanding
Figure 3 for Language-driven Open-Vocabulary 3D Scene Understanding
Figure 4 for Language-driven Open-Vocabulary 3D Scene Understanding
Viaarxiv icon

LUMix: Improving Mixup by Better Modelling Label Uncertainty

Add code
Nov 29, 2022
Figure 1 for LUMix: Improving Mixup by Better Modelling Label Uncertainty
Figure 2 for LUMix: Improving Mixup by Better Modelling Label Uncertainty
Figure 3 for LUMix: Improving Mixup by Better Modelling Label Uncertainty
Figure 4 for LUMix: Improving Mixup by Better Modelling Label Uncertainty
Viaarxiv icon

The Runner-up Solution for YouTube-VIS Long Video Challenge 2022

Add code
Nov 18, 2022
Viaarxiv icon

Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning

Add code
Oct 24, 2022
Viaarxiv icon

Is synthetic data from generative models ready for image recognition?

Add code
Oct 14, 2022
Figure 1 for Is synthetic data from generative models ready for image recognition?
Figure 2 for Is synthetic data from generative models ready for image recognition?
Figure 3 for Is synthetic data from generative models ready for image recognition?
Figure 4 for Is synthetic data from generative models ready for image recognition?
Viaarxiv icon

Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning

Add code
Oct 01, 2022
Figure 1 for Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
Figure 2 for Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
Figure 3 for Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
Figure 4 for Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning
Viaarxiv icon

1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition

Add code
Aug 04, 2022
Figure 1 for 1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition
Figure 2 for 1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition
Figure 3 for 1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition
Figure 4 for 1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: Cropped Word Recognition
Viaarxiv icon

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation

Add code
Jul 29, 2022
Figure 1 for Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
Figure 2 for Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
Figure 3 for Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
Figure 4 for Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation
Viaarxiv icon