Picture for Minseo Kim

Minseo Kim

KRETA: A Benchmark for Korean Reading and Reasoning in Text-Rich VQA Attuned to Diverse Visual Contexts

Add code
Aug 27, 2025
Viaarxiv icon

Comparison Reveals Commonality: Customized Image Generation through Contrastive Inversion

Add code
Aug 11, 2025
Viaarxiv icon

Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making

Add code
May 26, 2025
Viaarxiv icon

Dual Ascent Diffusion for Inverse Problems

Add code
May 23, 2025
Viaarxiv icon

DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding

Add code
Dec 02, 2024
Figure 1 for DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Figure 2 for DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Figure 3 for DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Figure 4 for DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Viaarxiv icon

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Add code
Oct 02, 2024
Figure 1 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 2 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 3 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Figure 4 for CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction
Viaarxiv icon

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

Add code
Jun 27, 2024
Figure 1 for Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding
Figure 2 for Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding
Figure 3 for Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding
Figure 4 for Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding
Viaarxiv icon