Picture for Qixiang Ye

Qixiang Ye

University of Chinese Academy of Sciences, Beijing, China

Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian

Add code
May 30, 2024
Figure 1 for Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian
Figure 2 for Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian
Figure 3 for Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian
Figure 4 for Uncertainty-guided Optimal Transport in Depth Supervised Sparse-View 3D Gaussian
Viaarxiv icon

vHeat: Building Vision Models upon Heat Conduction

Add code
May 26, 2024
Figure 1 for vHeat: Building Vision Models upon Heat Conduction
Figure 2 for vHeat: Building Vision Models upon Heat Conduction
Figure 3 for vHeat: Building Vision Models upon Heat Conduction
Figure 4 for vHeat: Building Vision Models upon Heat Conduction
Viaarxiv icon

DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution

Add code
May 25, 2024
Figure 1 for DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Figure 2 for DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Figure 3 for DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Figure 4 for DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Viaarxiv icon

Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning

Add code
Mar 09, 2024
Viaarxiv icon

Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting

Add code
Feb 06, 2024
Figure 1 for Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Figure 2 for Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Figure 3 for Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Figure 4 for Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Viaarxiv icon

BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection

Add code
Feb 06, 2024
Figure 1 for BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection
Figure 2 for BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection
Figure 3 for BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection
Figure 4 for BEAM: Beta Distribution Ray Denoising for Multi-view 3D Object Detection
Viaarxiv icon

Controllable Dense Captioner with Multimodal Embedding Bridging

Add code
Feb 01, 2024
Viaarxiv icon

CPR++: Object Localization via Single Coarse Point Supervision

Add code
Jan 30, 2024
Figure 1 for CPR++: Object Localization via Single Coarse Point Supervision
Figure 2 for CPR++: Object Localization via Single Coarse Point Supervision
Figure 3 for CPR++: Object Localization via Single Coarse Point Supervision
Figure 4 for CPR++: Object Localization via Single Coarse Point Supervision
Viaarxiv icon

ChatterBox: Multi-round Multimodal Referring and Grounding

Add code
Jan 24, 2024
Viaarxiv icon

VMamba: Visual State Space Model

Add code
Jan 18, 2024
Figure 1 for VMamba: Visual State Space Model
Figure 2 for VMamba: Visual State Space Model
Figure 3 for VMamba: Visual State Space Model
Figure 4 for VMamba: Visual State Space Model
Viaarxiv icon