Picture for Taehwan Kim

Taehwan Kim

Grouped Differential Attention

Add code
Oct 08, 2025
Viaarxiv icon

Towards Human-like Multimodal Conversational Agent by Generating Engaging Speech

Add code
Sep 18, 2025
Viaarxiv icon

Semi-Supervised Audio-Visual Video Action Recognition with Audio Source Localization Guided Mixup

Add code
Mar 04, 2025
Viaarxiv icon

RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals

Add code
Feb 18, 2025
Figure 1 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 2 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 3 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Figure 4 for RingFormer: Rethinking Recurrent Transformer with Adaptive Level Signals
Viaarxiv icon

Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation

Add code
Jan 14, 2025
Viaarxiv icon

Zero-shot Text-guided Infinite Image Synthesis with LLM guidance

Add code
Jul 17, 2024
Viaarxiv icon

Grid Diffusion Models for Text-to-Video Generation

Add code
Mar 30, 2024
Figure 1 for Grid Diffusion Models for Text-to-Video Generation
Figure 2 for Grid Diffusion Models for Text-to-Video Generation
Figure 3 for Grid Diffusion Models for Text-to-Video Generation
Figure 4 for Grid Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Sound of Story: Multi-modal Storytelling with Audio

Add code
Oct 30, 2023
Figure 1 for Sound of Story: Multi-modal Storytelling with Audio
Figure 2 for Sound of Story: Multi-modal Storytelling with Audio
Figure 3 for Sound of Story: Multi-modal Storytelling with Audio
Figure 4 for Sound of Story: Multi-modal Storytelling with Audio
Viaarxiv icon

Effective Slogan Generation with Noise Perturbation

Add code
Oct 12, 2023
Figure 1 for Effective Slogan Generation with Noise Perturbation
Figure 2 for Effective Slogan Generation with Noise Perturbation
Figure 3 for Effective Slogan Generation with Noise Perturbation
Figure 4 for Effective Slogan Generation with Noise Perturbation
Viaarxiv icon

Generating Realistic Images from In-the-wild Sounds

Add code
Sep 05, 2023
Viaarxiv icon