Picture for Taehwan Kim

Taehwan Kim

Semi-Supervised Audio-Visual Video Action Recognition with Audio Source Localization Guided Mixup

Add code
Mar 04, 2025
Viaarxiv icon

RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals

Add code
Feb 18, 2025
Figure 1 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 2 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 3 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 4 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Viaarxiv icon

Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation

Add code
Jan 14, 2025
Viaarxiv icon

Zero-shot Text-guided Infinite Image Synthesis with LLM guidance

Add code
Jul 17, 2024
Viaarxiv icon

Grid Diffusion Models for Text-to-Video Generation

Add code
Mar 30, 2024
Figure 1 for Grid Diffusion Models for Text-to-Video Generation
Figure 2 for Grid Diffusion Models for Text-to-Video Generation
Figure 3 for Grid Diffusion Models for Text-to-Video Generation
Figure 4 for Grid Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Sound of Story: Multi-modal Storytelling with Audio

Add code
Oct 30, 2023
Figure 1 for Sound of Story: Multi-modal Storytelling with Audio
Figure 2 for Sound of Story: Multi-modal Storytelling with Audio
Figure 3 for Sound of Story: Multi-modal Storytelling with Audio
Figure 4 for Sound of Story: Multi-modal Storytelling with Audio
Viaarxiv icon

Effective Slogan Generation with Noise Perturbation

Add code
Oct 12, 2023
Figure 1 for Effective Slogan Generation with Noise Perturbation
Figure 2 for Effective Slogan Generation with Noise Perturbation
Figure 3 for Effective Slogan Generation with Noise Perturbation
Figure 4 for Effective Slogan Generation with Noise Perturbation
Viaarxiv icon

Generating Realistic Images from In-the-wild Sounds

Add code
Sep 05, 2023
Viaarxiv icon

Technical Report for CVPR 2022 LOVEU AQTC Challenge

Add code
Jun 29, 2022
Figure 1 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 2 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 3 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Figure 4 for Technical Report for CVPR 2022 LOVEU AQTC Challenge
Viaarxiv icon

Understanding Beauty via Deep Facial Features

Add code
Apr 17, 2019
Figure 1 for Understanding Beauty via Deep Facial Features
Figure 2 for Understanding Beauty via Deep Facial Features
Figure 3 for Understanding Beauty via Deep Facial Features
Figure 4 for Understanding Beauty via Deep Facial Features
Viaarxiv icon