Picture for Jiyoung Lee

Jiyoung Lee

Seeing What You Say: Expressive Image Generation from Speech

Add code
Nov 05, 2025
Viaarxiv icon

Referee: Reference-aware Audiovisual Deepfake Detection

Add code
Oct 31, 2025
Viaarxiv icon

Trans-EnV: A Framework for Evaluating the Linguistic Robustness of LLMs Against English Varieties

Add code
May 27, 2025
Viaarxiv icon

Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations

Add code
Mar 25, 2025
Figure 1 for Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Figure 2 for Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Figure 3 for Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Figure 4 for Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
Viaarxiv icon

Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation

Add code
Oct 13, 2024
Figure 1 for Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation
Figure 2 for Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation
Figure 3 for Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation
Figure 4 for Single Ground Truth Is Not Enough: Add Linguistic Variability to Aspect-based Sentiment Analysis Evaluation
Viaarxiv icon

Read, Watch and Scream! Sound Generation from Text and Video

Add code
Jul 08, 2024
Figure 1 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 2 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 3 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 4 for Read, Watch and Scream! Sound Generation from Text and Video
Viaarxiv icon

Bridging Vision and Language Spaces with Assignment Prediction

Add code
Apr 15, 2024
Viaarxiv icon

KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge

Add code
Feb 22, 2024
Viaarxiv icon

Dense Text-to-Image Generation with Attention Modulation

Add code
Aug 24, 2023
Figure 1 for Dense Text-to-Image Generation with Attention Modulation
Figure 2 for Dense Text-to-Image Generation with Attention Modulation
Figure 3 for Dense Text-to-Image Generation with Attention Modulation
Figure 4 for Dense Text-to-Image Generation with Attention Modulation
Viaarxiv icon

Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning

Add code
Aug 08, 2023
Figure 1 for Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning
Figure 2 for Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning
Figure 3 for Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning
Figure 4 for Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning
Viaarxiv icon