Picture for Minseok Kang

Minseok Kang

Towards Direct Evaluation of Harness Optimizers via Priority Ranking

Add code
May 21, 2026
Viaarxiv icon

OTT-Vid: Optimal Transport Temporal Token Compression for Video Large Language Models

Add code
May 12, 2026
Viaarxiv icon

CMTM: Cross-Modal Token Modulation for Unsupervised Video Object Segmentation

Add code
Apr 16, 2026
Viaarxiv icon

Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpainting

Add code
Apr 16, 2026
Viaarxiv icon

LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models

Add code
Mar 30, 2026
Viaarxiv icon

Revisiting Weakly-Supervised Video Scene Graph Generation via Pair Affinity Learning

Add code
Mar 23, 2026
Viaarxiv icon

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Add code
May 21, 2025
Figure 1 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 2 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 3 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 4 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Viaarxiv icon

GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

Add code
Apr 21, 2025
Viaarxiv icon

Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory

Add code
Jul 03, 2024
Figure 1 for Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Figure 2 for Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Figure 3 for Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Figure 4 for Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory
Viaarxiv icon

Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding

Add code
Apr 03, 2020
Figure 1 for Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Figure 2 for Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Figure 3 for Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Figure 4 for Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Viaarxiv icon