Picture for Gui-Song Xia

Gui-Song Xia

DMTG: One-Shot Differentiable Multi-Task Grouping

Add code
Jul 06, 2024
Viaarxiv icon

Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference

Add code
Jun 26, 2024
Figure 1 for Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference
Figure 2 for Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference
Figure 3 for Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference
Figure 4 for Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference
Viaarxiv icon

Optimization-based Structural Pruning for Large Language Models without Back-Propagation

Add code
Jun 15, 2024
Viaarxiv icon

Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost

Add code
May 09, 2024
Viaarxiv icon

Dual Relation Mining Network for Zero-Shot Learning

Add code
May 06, 2024
Figure 1 for Dual Relation Mining Network for Zero-Shot Learning
Figure 2 for Dual Relation Mining Network for Zero-Shot Learning
Figure 3 for Dual Relation Mining Network for Zero-Shot Learning
Figure 4 for Dual Relation Mining Network for Zero-Shot Learning
Viaarxiv icon

Anchor-based Robust Finetuning of Vision-Language Models

Add code
Apr 09, 2024
Figure 1 for Anchor-based Robust Finetuning of Vision-Language Models
Figure 2 for Anchor-based Robust Finetuning of Vision-Language Models
Figure 3 for Anchor-based Robust Finetuning of Vision-Language Models
Figure 4 for Anchor-based Robust Finetuning of Vision-Language Models
Viaarxiv icon

3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions

Add code
Apr 07, 2024
Figure 1 for 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
Figure 2 for 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
Figure 3 for 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
Figure 4 for 3D Building Reconstruction from Monocular Remote Sensing Images with Multi-level Supervisions
Viaarxiv icon

H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

Add code
Mar 29, 2024
Figure 1 for H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
Figure 2 for H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
Figure 3 for H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
Figure 4 for H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model
Viaarxiv icon

Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization

Add code
Mar 21, 2024
Figure 1 for Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization
Figure 2 for Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization
Figure 3 for Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization
Figure 4 for Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization
Viaarxiv icon

Learning Cross-view Visual Geo-localization without Ground Truth

Add code
Mar 19, 2024
Figure 1 for Learning Cross-view Visual Geo-localization without Ground Truth
Figure 2 for Learning Cross-view Visual Geo-localization without Ground Truth
Figure 3 for Learning Cross-view Visual Geo-localization without Ground Truth
Figure 4 for Learning Cross-view Visual Geo-localization without Ground Truth
Viaarxiv icon