Picture for Xuming He

Xuming He

SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models

Add code
Nov 13, 2023
Figure 1 for SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Figure 2 for SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Figure 3 for SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Figure 4 for SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Viaarxiv icon

The Robust Semantic Segmentation UNCV2023 Challenge Results

Add code
Sep 27, 2023
Figure 1 for The Robust Semantic Segmentation UNCV2023 Challenge Results
Figure 2 for The Robust Semantic Segmentation UNCV2023 Challenge Results
Figure 3 for The Robust Semantic Segmentation UNCV2023 Challenge Results
Figure 4 for The Robust Semantic Segmentation UNCV2023 Challenge Results
Viaarxiv icon

ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation

Add code
Sep 12, 2023
Figure 1 for ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
Figure 2 for ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
Figure 3 for ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
Figure 4 for ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
Viaarxiv icon

Grounded Image Text Matching with Mismatched Relation Reasoning

Add code
Aug 04, 2023
Figure 1 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 2 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 3 for Grounded Image Text Matching with Mismatched Relation Reasoning
Figure 4 for Grounded Image Text Matching with Mismatched Relation Reasoning
Viaarxiv icon

Human-centric Scene Understanding for 3D Large-scale Scenarios

Add code
Jul 26, 2023
Figure 1 for Human-centric Scene Understanding for 3D Large-scale Scenarios
Figure 2 for Human-centric Scene Understanding for 3D Large-scale Scenarios
Figure 3 for Human-centric Scene Understanding for 3D Large-scale Scenarios
Figure 4 for Human-centric Scene Understanding for 3D Large-scale Scenarios
Viaarxiv icon

MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels

Add code
Jun 20, 2023
Figure 1 for MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels
Figure 2 for MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels
Figure 3 for MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels
Figure 4 for MILD: Modeling the Instance Learning Dynamics for Learning with Noisy Labels
Viaarxiv icon

Scalable wavelength-multiplexing photonic reservoir computing

Add code
May 24, 2023
Figure 1 for Scalable wavelength-multiplexing photonic reservoir computing
Figure 2 for Scalable wavelength-multiplexing photonic reservoir computing
Figure 3 for Scalable wavelength-multiplexing photonic reservoir computing
Figure 4 for Scalable wavelength-multiplexing photonic reservoir computing
Viaarxiv icon

HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models

Add code
Mar 29, 2023
Viaarxiv icon

Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning

Add code
Mar 02, 2023
Figure 1 for Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning
Figure 2 for Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning
Figure 3 for Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning
Figure 4 for Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning
Viaarxiv icon

Dynamic Grained Encoder for Vision Transformers

Add code
Jan 10, 2023
Viaarxiv icon