Picture for Jaeyoung Kim

Jaeyoung Kim

Rethinking the Pointer Loss in Table Structure Recognition: Geometry-Aware Pointer Loss for Spatial Locality

Add code
Jun 17, 2026
Viaarxiv icon

Disentangling Visual and Factual Correctness in LVLMs' Visualization Literacy

Add code
Jun 02, 2026
Viaarxiv icon

Voxtral TTS

Add code
Mar 26, 2026
Viaarxiv icon

Voxtral Realtime

Add code
Feb 11, 2026
Viaarxiv icon

Adaptive Retrieval for Reasoning-Intensive Retrieval

Add code
Jan 08, 2026
Viaarxiv icon

Relevance to Utility: Process-Supervised Rewrite for RAG

Add code
Sep 19, 2025
Viaarxiv icon

Leveraging Multimodal LLM for Inspirational User Interface Search

Add code
Jan 30, 2025
Figure 1 for Leveraging Multimodal LLM for Inspirational User Interface Search
Figure 2 for Leveraging Multimodal LLM for Inspirational User Interface Search
Figure 3 for Leveraging Multimodal LLM for Inspirational User Interface Search
Figure 4 for Leveraging Multimodal LLM for Inspirational User Interface Search
Viaarxiv icon

Multi-LLM Collaborative Caption Generation in Scientific Documents

Add code
Jan 05, 2025
Figure 1 for Multi-LLM Collaborative Caption Generation in Scientific Documents
Figure 2 for Multi-LLM Collaborative Caption Generation in Scientific Documents
Figure 3 for Multi-LLM Collaborative Caption Generation in Scientific Documents
Figure 4 for Multi-LLM Collaborative Caption Generation in Scientific Documents
Viaarxiv icon

DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling

Add code
Sep 25, 2024
Figure 1 for DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Figure 2 for DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Figure 3 for DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Figure 4 for DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
Viaarxiv icon

Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition

Add code
Aug 05, 2024
Figure 1 for Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition
Figure 2 for Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition
Figure 3 for Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition
Figure 4 for Clustering and Mining Accented Speech for Inclusive and Fair Speech Recognition
Viaarxiv icon