Picture for Soo-Hyung Kim

Soo-Hyung Kim

Latent Behavior Diffusion for Sequential Reaction Generation in Dyadic Setting

Add code
May 12, 2025
Viaarxiv icon

Anatomical Attention Alignment representation for Radiology Report Generation

Add code
May 12, 2025
Viaarxiv icon

Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization

Add code
Nov 19, 2024
Figure 1 for Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
Figure 2 for Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
Figure 3 for Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
Figure 4 for Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization
Viaarxiv icon

Polyp-SES: Automatic Polyp Segmentation with Self-Enriched Semantic Model

Add code
Oct 02, 2024
Viaarxiv icon

KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation

Add code
Sep 09, 2024
Figure 1 for KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Figure 2 for KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Figure 3 for KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Figure 4 for KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks Generation
Viaarxiv icon

Leveraging WaveNet for Dynamic Listening Head Modeling from Speech

Add code
Sep 08, 2024
Figure 1 for Leveraging WaveNet for Dynamic Listening Head Modeling from Speech
Figure 2 for Leveraging WaveNet for Dynamic Listening Head Modeling from Speech
Figure 3 for Leveraging WaveNet for Dynamic Listening Head Modeling from Speech
Figure 4 for Leveraging WaveNet for Dynamic Listening Head Modeling from Speech
Viaarxiv icon

Transformer with Leveraged Masked Autoencoder for video-based Pain Assessment

Add code
Sep 08, 2024
Viaarxiv icon

Adaptation of Distinct Semantics for Uncertain Areas in Polyp Segmentation

Add code
May 13, 2024
Viaarxiv icon

DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation

Add code
Jul 31, 2023
Viaarxiv icon

Mental Workload Estimation with Electroencephalogram Signals by Combining Multi-Space Deep Models

Add code
Jul 23, 2023
Viaarxiv icon