Picture for Adriana Kovashka

Adriana Kovashka

Learning Consistent Temporal Grounding between Related Tasks in Sports Coaching

Add code
Mar 19, 2026
Viaarxiv icon

Generalizing Sports Feedback Generation by Watching Competitions and Reading Books: A Rock Climbing Case Study

Add code
Feb 09, 2026
Viaarxiv icon

Culture in Action: Evaluating Text-to-Image Models through Social Activities

Add code
Nov 07, 2025
Viaarxiv icon

A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling

Add code
Apr 19, 2025
Figure 1 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 2 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 3 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Figure 4 for A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Viaarxiv icon

Investigating and Improving Counter-Stereotypical Action Relation in Text-to-Image Diffusion Models

Add code
Mar 13, 2025
Viaarxiv icon

Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting

Add code
Mar 10, 2025
Figure 1 for Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Figure 2 for Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Figure 3 for Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Figure 4 for Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting
Viaarxiv icon

CAP: Evaluation of Persuasive and Creative Image Generation

Add code
Dec 10, 2024
Figure 1 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 2 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 3 for CAP: Evaluation of Persuasive and Creative Image Generation
Figure 4 for CAP: Evaluation of Persuasive and Creative Image Generation
Viaarxiv icon

Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval

Add code
Oct 02, 2024
Figure 1 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 2 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 3 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Figure 4 for Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval
Viaarxiv icon

Benchmarking VLMs' Reasoning About Persuasive Atypical Images

Add code
Sep 16, 2024
Figure 1 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 2 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 3 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Figure 4 for Benchmarking VLMs' Reasoning About Persuasive Atypical Images
Viaarxiv icon

Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition

Add code
Sep 15, 2024
Figure 1 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 2 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 3 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Figure 4 for Integrating Audio Narrations to Strengthen Domain Generalization in Multimodal First-Person Action Recognition
Viaarxiv icon