Picture for Yingjin Song

Yingjin Song

Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?

Add code
Jun 12, 2025
Viaarxiv icon

Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning

Add code
Aug 12, 2024
Figure 1 for Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning
Figure 2 for Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning
Figure 3 for Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning
Figure 4 for Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning
Viaarxiv icon

Modelling Emotion Dynamics in Song Lyrics with State Space Models

Add code
Oct 17, 2022
Figure 1 for Modelling Emotion Dynamics in Song Lyrics with State Space Models
Figure 2 for Modelling Emotion Dynamics in Song Lyrics with State Space Models
Figure 3 for Modelling Emotion Dynamics in Song Lyrics with State Space Models
Figure 4 for Modelling Emotion Dynamics in Song Lyrics with State Space Models
Viaarxiv icon