Picture for Dong-Jin Kim

Dong-Jin Kim

djnjusa@kaist.ac.kr

SIDA: Synthetic Image Driven Zero-shot Domain Adaptation

Add code
Jul 24, 2025
Viaarxiv icon

SynC: Synthetic Image Caption Dataset Refinement with One-to-many Mapping for Zero-shot Image Captioning

Add code
Jul 24, 2025
Viaarxiv icon

VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness

Add code
Mar 20, 2025
Viaarxiv icon

LensNet: Enhancing Real-time Microlensing Event Discovery with Recurrent Neural Networks in the Korea Microlensing Telescope Network

Add code
Jan 10, 2025
Viaarxiv icon

ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning

Add code
Dec 26, 2024
Viaarxiv icon

Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality

Add code
Oct 07, 2024
Figure 1 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 2 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 3 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Figure 4 for Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality
Viaarxiv icon

IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning

Add code
Sep 26, 2024
Figure 1 for IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
Figure 2 for IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
Figure 3 for IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
Figure 4 for IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning
Viaarxiv icon

Self-Sufficient Framework for Continuous Sign Language Recognition

Add code
Mar 21, 2023
Figure 1 for Self-Sufficient Framework for Continuous Sign Language Recognition
Figure 2 for Self-Sufficient Framework for Continuous Sign Language Recognition
Figure 3 for Self-Sufficient Framework for Continuous Sign Language Recognition
Figure 4 for Self-Sufficient Framework for Continuous Sign Language Recognition
Viaarxiv icon

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

Add code
Jan 26, 2023
Figure 1 for Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
Figure 2 for Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
Figure 3 for Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
Figure 4 for Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data
Viaarxiv icon

Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition

Add code
Nov 01, 2022
Figure 1 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 2 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 3 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Figure 4 for Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language Recognition
Viaarxiv icon