Picture for Taeoh Kim

Taeoh Kim

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

Add code
Jul 10, 2025
Viaarxiv icon

Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval

Add code
Apr 17, 2025
Viaarxiv icon

CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images

Add code
Mar 07, 2025
Viaarxiv icon

CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images

Add code
Dec 20, 2024
Figure 1 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 2 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 3 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Figure 4 for CoCoGaussian: Leveraging Circle of Confusion for Gaussian Splatting from Defocused Images
Viaarxiv icon

A Simple Baseline with Single-encoder for Referring Image Segmentation

Add code
Aug 28, 2024
Figure 1 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 2 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 3 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Figure 4 for A Simple Baseline with Single-encoder for Referring Image Segmentation
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Figure 1 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 2 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 3 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 4 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Viaarxiv icon

Masked Autoencoder for Unsupervised Video Summarization

Add code
Jun 02, 2023
Figure 1 for Masked Autoencoder for Unsupervised Video Summarization
Figure 2 for Masked Autoencoder for Unsupervised Video Summarization
Figure 3 for Masked Autoencoder for Unsupervised Video Summarization
Figure 4 for Masked Autoencoder for Unsupervised Video Summarization
Viaarxiv icon

Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection

Add code
Mar 30, 2023
Figure 1 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 2 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 3 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Figure 4 for Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
Viaarxiv icon

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Add code
Jun 30, 2022
Figure 1 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 2 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 3 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Figure 4 for Exploring Temporally Dynamic Data Augmentation for Video Recognition
Viaarxiv icon

Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning

Add code
Apr 08, 2022
Figure 1 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 2 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 3 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Figure 4 for Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
Viaarxiv icon