Picture for Xiaoshan Yang

Xiaoshan Yang

Libra: Building Decoupled Vision System on Large Language Models

Add code
May 16, 2024
Viaarxiv icon

HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding

Add code
Apr 20, 2024
Viaarxiv icon

Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection

Aug 30, 2023
Figure 1 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 2 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 3 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Figure 4 for Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
Viaarxiv icon

Multi-modal Queried Object Detection in the Wild

Add code
May 30, 2023
Figure 1 for Multi-modal Queried Object Detection in the Wild
Figure 2 for Multi-modal Queried Object Detection in the Wild
Figure 3 for Multi-modal Queried Object Detection in the Wild
Figure 4 for Multi-modal Queried Object Detection in the Wild
Viaarxiv icon

CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding

Add code
May 15, 2023
Figure 1 for CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding
Figure 2 for CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding
Figure 3 for CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding
Figure 4 for CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding
Viaarxiv icon

SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification

Nov 28, 2022
Figure 1 for SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification
Figure 2 for SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification
Figure 3 for SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification
Figure 4 for SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification
Viaarxiv icon

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

Add code
Mar 29, 2022
Figure 1 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 2 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 3 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 4 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Viaarxiv icon

Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition

Dec 20, 2021
Figure 1 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 2 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 3 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 4 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Viaarxiv icon

ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning

Jun 16, 2021
Figure 1 for ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning
Figure 2 for ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning
Figure 3 for ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning
Figure 4 for ECKPN: Explicit Class Knowledge Propagation Network for Transductive Few-shot Learning
Viaarxiv icon

Health Status Prediction with Local-Global Heterogeneous Behavior Graph

Mar 23, 2021
Figure 1 for Health Status Prediction with Local-Global Heterogeneous Behavior Graph
Figure 2 for Health Status Prediction with Local-Global Heterogeneous Behavior Graph
Figure 3 for Health Status Prediction with Local-Global Heterogeneous Behavior Graph
Figure 4 for Health Status Prediction with Local-Global Heterogeneous Behavior Graph
Viaarxiv icon