Picture for Seungryong Kim

Seungryong Kim

Visual Representation Alignment for Multimodal Large Language Models

Add code
Sep 09, 2025
Viaarxiv icon

Learning to Track Any Points from Human Motion

Add code
Jul 08, 2025
Viaarxiv icon

PLOT: Pseudo-Labeling via Video Object Tracking for Scalable Monocular 3D Object Detection

Add code
Jul 03, 2025
Viaarxiv icon

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Add code
Jun 16, 2025
Viaarxiv icon

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

Add code
Jun 13, 2025
Viaarxiv icon

Fine-Grained Perturbation Guidance via Attention Head Selection

Add code
Jun 12, 2025
Viaarxiv icon

Text-Aware Image Restoration with Diffusion Models

Add code
Jun 11, 2025
Viaarxiv icon

Active Test-time Vision-Language Navigation

Add code
Jun 07, 2025
Viaarxiv icon

Seurat: From Moving Points to Depth

Add code
Apr 20, 2025
Viaarxiv icon

D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes

Add code
Apr 08, 2025
Viaarxiv icon