Picture for Adriana Kovashka

Adriana Kovashka

A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling

Add code
Apr 19, 2025
Figure 1 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 2 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 3 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 4 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Viaarxiv icon

Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting

Add code
Mar 10, 2025
Viaarxiv icon

CAP: Evaluation of Persuasive and Creative Image Generation

Add code
Dec 10, 2024
Figure 1 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 2 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 3 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 4 for CAP: Evaluation of Persuasive and Creative Image Generation
Viaarxiv icon

Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval

Add code
Oct 02, 2024
Figure 1 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 2 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 3 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 4 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Viaarxiv icon

Benchmarking VLMs' Reasoning About Persuasive Atypical Images

Add code
Sep 16, 2024
Figure 1 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 2 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 3 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 4 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Viaarxiv icon

Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition

Add code
Sep 15, 2024
Figure 1 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 2 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 3 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 4 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Viaarxiv icon

Enhancing Weakly-Supervised Object Detection on Static Images through (Hallucinated) Motion

Add code
Sep 15, 2024
Viaarxiv icon

What metrics of participation balance predict outcomes of collaborative learning with a robot?

Add code
May 17, 2024
Figure 1 for What metrics of participation balance predict outcomes of collaborative learning with a robot?
Figure 2 for What metrics of participation balance predict outcomes of collaborative learning with a robot?
Figure 3 for What metrics of participation balance predict outcomes of collaborative learning with a robot?
Figure 4 for What metrics of participation balance predict outcomes of collaborative learning with a robot?
Viaarxiv icon

Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

Add code
Jan 03, 2024
Viaarxiv icon