Picture for Minho Shim

Minho Shim

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Add code
Jul 10, 2025
Viaarxiv icon

Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval

Add code
Apr 17, 2025
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Figure 1 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 2 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 3 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 4 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Viaarxiv icon

Masked Autoencoder for Unsupervised Video Summarization

Add code
Jun 02, 2023
Figure 1 for Masked Autoencoder for Unsupervised Video Summarization
Figure 2 for Masked Autoencoder for Unsupervised Video Summarization
Figure 3 for Masked Autoencoder for Unsupervised Video Summarization
Figure 4 for Masked Autoencoder for Unsupervised Video Summarization
Viaarxiv icon

Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection

Add code
Mar 30, 2023
Figure 1 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 2 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 3 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 4 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Viaarxiv icon

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Add code
Jun 30, 2022
Figure 1 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 2 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 3 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 4 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Viaarxiv icon

Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning

Add code
Apr 08, 2022
Figure 1 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 2 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 3 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 4 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Viaarxiv icon