Picture for Yifei Ming

Yifei Ming

Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models

Add code
Jun 21, 2024
Figure 1 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 2 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 3 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 4 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Viaarxiv icon

Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models

Add code
May 02, 2024
Viaarxiv icon

Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Add code
Mar 29, 2024
Figure 1 for Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Figure 2 for Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Figure 3 for Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Figure 4 for Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models
Viaarxiv icon

HYPO: Hyperspherical Out-of-Distribution Generalization

Add code
Feb 12, 2024
Viaarxiv icon

How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?

Add code
Jun 09, 2023
Figure 1 for How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?
Figure 2 for How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?
Figure 3 for How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?
Figure 4 for How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models?
Viaarxiv icon

Domain Generalization via Nuclear Norm Regularization

Add code
Mar 13, 2023
Figure 1 for Domain Generalization via Nuclear Norm Regularization
Figure 2 for Domain Generalization via Nuclear Norm Regularization
Figure 3 for Domain Generalization via Nuclear Norm Regularization
Viaarxiv icon

Delving into Out-of-Distribution Detection with Vision-Language Representations

Add code
Nov 24, 2022
Figure 1 for Delving into Out-of-Distribution Detection with Vision-Language Representations
Figure 2 for Delving into Out-of-Distribution Detection with Vision-Language Representations
Figure 3 for Delving into Out-of-Distribution Detection with Vision-Language Representations
Figure 4 for Delving into Out-of-Distribution Detection with Vision-Language Representations
Viaarxiv icon

POEM: Out-of-Distribution Detection with Posterior Sampling

Add code
Jun 28, 2022
Figure 1 for POEM: Out-of-Distribution Detection with Posterior Sampling
Figure 2 for POEM: Out-of-Distribution Detection with Posterior Sampling
Figure 3 for POEM: Out-of-Distribution Detection with Posterior Sampling
Figure 4 for POEM: Out-of-Distribution Detection with Posterior Sampling
Viaarxiv icon

Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

Add code
May 23, 2022
Figure 1 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 2 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 3 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 4 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Viaarxiv icon

Out-of-distribution Detection with Deep Nearest Neighbors

Add code
Apr 13, 2022
Figure 1 for Out-of-distribution Detection with Deep Nearest Neighbors
Figure 2 for Out-of-distribution Detection with Deep Nearest Neighbors
Figure 3 for Out-of-distribution Detection with Deep Nearest Neighbors
Figure 4 for Out-of-distribution Detection with Deep Nearest Neighbors
Viaarxiv icon