Picture for Jinwoo Shin

Jinwoo Shin

Korea Advanced Institute of Science and Technology

Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps

Add code
Mar 24, 2026
Viaarxiv icon

SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning

Add code
Mar 23, 2026
Viaarxiv icon

RoboAlign: Learning Test-Time Reasoning for Language-Action Alignment in Vision-Language-Action Models

Add code
Mar 22, 2026
Viaarxiv icon

Beyond Correctness: Learning Robust Reasoning via Transfer

Add code
Feb 09, 2026
Viaarxiv icon

Vision-aligned Latent Reasoning for Multi-modal Large Language Model

Add code
Feb 04, 2026
Viaarxiv icon

Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model

Add code
Oct 31, 2025
Viaarxiv icon

ContextVLA: Vision-Language-Action Model with Amortized Multi-Frame Context

Add code
Oct 05, 2025
Viaarxiv icon

Contrastive Representation Regularization for Vision-Language-Action Models

Add code
Oct 02, 2025
Viaarxiv icon

HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy

Add code
Oct 02, 2025
Viaarxiv icon

CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection

Add code
Jun 13, 2025
Viaarxiv icon