Picture for Adriana Kovashka

Adriana Kovashka

A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling

Add code
Apr 19, 2025
Viaarxiv icon

Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting

Add code
Mar 10, 2025
Viaarxiv icon

CAP: Evaluation of Persuasive and Creative Image Generation

Add code
Dec 10, 2024
Viaarxiv icon

Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval

Add code
Oct 02, 2024
Figure 1 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 2 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 3 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 4 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Viaarxiv icon

Benchmarking VLMs' Reasoning About Persuasive Atypical Images

Add code
Sep 16, 2024
Figure 1 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 2 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 3 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 4 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Viaarxiv icon

Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition

Add code
Sep 15, 2024
Figure 1 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 2 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 3 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 4 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Viaarxiv icon

Enhancing Weakly-Supervised Object Detection on Static Images through (Hallucinated) Motion

Add code
Sep 15, 2024
Viaarxiv icon

What metrics of participation balance predict outcomes of collaborative learning with a robot?

Add code
May 17, 2024
Viaarxiv icon

Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

Add code
Jan 03, 2024
Viaarxiv icon