Picture for Ye-Chan Kim

Ye-Chan Kim

SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning

Add code
Mar 05, 2026
Viaarxiv icon

Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning

Add code
Sep 04, 2025
Figure 1 for Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Figure 2 for Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Figure 3 for Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Figure 4 for Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning
Viaarxiv icon

SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning

Add code
Jul 24, 2025
Figure 1 for SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Figure 2 for SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Figure 3 for SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Figure 4 for SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning
Viaarxiv icon

SIDA: Synthetic Image Driven Zero-shot Domain Adaptation

Add code
Jul 24, 2025
Figure 1 for SIDA: Synthetic Image Driven Zero-shot Domain Adaptation
Figure 2 for SIDA: Synthetic Image Driven Zero-shot Domain Adaptation
Figure 3 for SIDA: Synthetic Image Driven Zero-shot Domain Adaptation
Figure 4 for SIDA: Synthetic Image Driven Zero-shot Domain Adaptation
Viaarxiv icon

VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness

Add code
Mar 20, 2025
Figure 1 for VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Figure 2 for VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Figure 3 for VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Figure 4 for VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
Viaarxiv icon