Picture for Zeynep Akata

Zeynep Akata

Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models

Add code
Apr 09, 2024
Figure 1 for Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
Figure 2 for Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
Figure 3 for Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
Figure 4 for Audio-Visual Generalized Zero-Shot Learning using Pre-Trained Large Multi-Modal Models
Viaarxiv icon

Opening the Black-Box: A Systematic Review on Explainable AI in Remote Sensing

Add code
Feb 21, 2024
Viaarxiv icon

How should the advent of large language models affect the practice of science?

Add code
Dec 05, 2023
Viaarxiv icon

Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation

Add code
Nov 25, 2023
Viaarxiv icon

Zero-shot audio captioning with audio-language model guidance and audio context keywords

Add code
Nov 14, 2023
Viaarxiv icon

Zero-shot Translation of Attention Patterns in VQA Models to Natural Language

Add code
Nov 08, 2023
Figure 1 for Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
Figure 2 for Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
Figure 3 for Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
Figure 4 for Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
Viaarxiv icon

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

Add code
Oct 26, 2023
Viaarxiv icon

Transitivity Recovering Decompositions: Interpretable and Robust Fine-Grained Relationships

Add code
Oct 24, 2023
Viaarxiv icon

Vision-by-Language for Training-Free Compositional Image Retrieval

Add code
Oct 13, 2023
Figure 1 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 2 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 3 for Vision-by-Language for Training-Free Compositional Image Retrieval
Figure 4 for Vision-by-Language for Training-Free Compositional Image Retrieval
Viaarxiv icon

Video-adverb retrieval with compositional adverb-action embeddings

Add code
Sep 26, 2023
Viaarxiv icon