Picture for Yerim So

Yerim So

Spatio-Temporal Similarity Volume Aggregation for Open-Vocabulary Action Recognition

Add code
May 22, 2026
Viaarxiv icon

Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision

Add code
Mar 20, 2026
Viaarxiv icon