Picture for Jinwoo Shin

Jinwoo Shin

Korea Advanced Institute of Science and Technology

Dual-Stream Diffusion for World-Model Augmented Vision-Language-Action Model

Add code
Oct 31, 2025
Viaarxiv icon

ContextVLA: Vision-Language-Action Model with Amortized Multi-Frame Context

Add code
Oct 05, 2025
Viaarxiv icon

Contrastive Representation Regularization for Vision-Language-Action Models

Add code
Oct 02, 2025
Viaarxiv icon

HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy

Add code
Oct 02, 2025
Viaarxiv icon

CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection

Add code
Jun 13, 2025
Viaarxiv icon

Collaborative LLM Inference via Planning for Efficient Reasoning

Add code
Jun 13, 2025
Figure 1 for Collaborative LLM Inference via Planning for Efficient Reasoning
Figure 2 for Collaborative LLM Inference via Planning for Efficient Reasoning
Figure 3 for Collaborative LLM Inference via Planning for Efficient Reasoning
Figure 4 for Collaborative LLM Inference via Planning for Efficient Reasoning
Viaarxiv icon

Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance

Add code
Jun 10, 2025
Viaarxiv icon

FontAdapter: Instant Font Adaptation in Visual Text Generation

Add code
Jun 06, 2025
Viaarxiv icon

Accelerated Test-Time Scaling with Model-Free Speculative Sampling

Add code
Jun 05, 2025
Viaarxiv icon

Sparsified State-Space Models are Efficient Highway Networks

Add code
May 27, 2025
Viaarxiv icon