Picture for Yova Kementchedjhieva

Yova Kementchedjhieva

SPECS: Specificity-Enhanced CLIP-Score for Long Image Caption Evaluation

Add code
Sep 04, 2025
Viaarxiv icon

CONCAP: Seeing Beyond English with Concepts Retrieval-Augmented Captioning

Add code
Jul 27, 2025
Viaarxiv icon

The Devil is in the EOS: Sequence Training for Detailed Image Captioning

Add code
Jul 26, 2025
Viaarxiv icon

LLMs Can Compensate for Deficiencies in Visual Representations

Add code
Jun 05, 2025
Figure 1 for LLMs Can Compensate for Deficiencies in Visual Representations
Figure 2 for LLMs Can Compensate for Deficiencies in Visual Representations
Figure 3 for LLMs Can Compensate for Deficiencies in Visual Representations
Figure 4 for LLMs Can Compensate for Deficiencies in Visual Representations
Viaarxiv icon

Overcoming Vocabulary Constraints with Pixel-level Fallback

Add code
Apr 02, 2025
Figure 1 for Overcoming Vocabulary Constraints with Pixel-level Fallback
Figure 2 for Overcoming Vocabulary Constraints with Pixel-level Fallback
Figure 3 for Overcoming Vocabulary Constraints with Pixel-level Fallback
Figure 4 for Overcoming Vocabulary Constraints with Pixel-level Fallback
Viaarxiv icon

JEEM: Vision-Language Understanding in Four Arabic Dialects

Add code
Mar 27, 2025
Viaarxiv icon

Noise is an Efficient Learner for Zero-Shot Vision-Language Models

Add code
Feb 09, 2025
Viaarxiv icon

EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval

Add code
Nov 28, 2024
Figure 1 for EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Figure 2 for EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Figure 3 for EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Figure 4 for EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Viaarxiv icon

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Add code
Jun 10, 2024
Figure 1 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 2 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 3 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Figure 4 for CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
Viaarxiv icon

MuLan: A Study of Fact Mutability in Language Models

Add code
Apr 03, 2024
Figure 1 for MuLan: A Study of Fact Mutability in Language Models
Figure 2 for MuLan: A Study of Fact Mutability in Language Models
Figure 3 for MuLan: A Study of Fact Mutability in Language Models
Figure 4 for MuLan: A Study of Fact Mutability in Language Models
Viaarxiv icon