Picture for Minkyu Kim

Minkyu Kim

VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?

Add code
Mar 09, 2026
Viaarxiv icon

See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

Add code
Feb 24, 2026
Viaarxiv icon

Energy-based generator matching: A neural sampler for general state space

Add code
May 26, 2025
Figure 1 for Energy-based generator matching: A neural sampler for general state space
Figure 2 for Energy-based generator matching: A neural sampler for general state space
Figure 3 for Energy-based generator matching: A neural sampler for general state space
Figure 4 for Energy-based generator matching: A neural sampler for general state space
Viaarxiv icon

On scalable and efficient training of diffusion samplers

Add code
May 26, 2025
Viaarxiv icon

Alignment without Over-optimization: Training-Free Solution for Diffusion Models

Add code
Jan 10, 2025
Figure 1 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models
Figure 2 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models
Figure 3 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models
Figure 4 for Alignment without Over-optimization: Training-Free Solution for Diffusion Models
Viaarxiv icon

Text Change Detection in Multilingual Documents Using Image Comparison

Add code
Dec 05, 2024
Viaarxiv icon

Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance

Add code
Oct 29, 2024
Figure 1 for Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Figure 2 for Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Figure 3 for Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Figure 4 for Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
Viaarxiv icon

Transparent Networks for Multivariate Time Series

Add code
Oct 14, 2024
Viaarxiv icon

KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations

Add code
Mar 05, 2024
Viaarxiv icon

Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity

Add code
Mar 05, 2024
Viaarxiv icon