Picture for Sangyoun Lee

Sangyoun Lee

OTT-Vid: Optimal Transport Temporal Token Compression for Video Large Language Models

Add code
May 12, 2026
Viaarxiv icon

Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpainting

Add code
Apr 16, 2026
Viaarxiv icon

CMTM: Cross-Modal Token Modulation for Unsupervised Video Object Segmentation

Add code
Apr 16, 2026
Viaarxiv icon

MoRGS: Efficient Per-Gaussian Motion Reasoning for Streamable Dynamic 3D Scenes

Add code
Mar 26, 2026
Viaarxiv icon

Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity Learning

Add code
Mar 23, 2026
Viaarxiv icon

MonoCLUE : Object-Aware Clustering Enhances Monocular 3D Object Detection

Add code
Nov 11, 2025
Viaarxiv icon

DualFocus: Depth from Focus with Spatio-Focal Dual Variational Constraints

Add code
Sep 26, 2025
Viaarxiv icon

GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

Add code
Apr 21, 2025
Viaarxiv icon

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

Add code
Mar 07, 2025
Viaarxiv icon

Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation

Add code
Mar 05, 2025
Figure 1 for Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Figure 2 for Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Figure 3 for Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Figure 4 for Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation
Viaarxiv icon