Picture for Seungryong Kim

Seungryong Kim

Understanding and Accelerating the Training of Masked Diffusion Language Models

Add code
May 13, 2026
Viaarxiv icon

TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking

Add code
May 12, 2026
Viaarxiv icon

Entropy-Gradient Grounding: Training-Free Evidence Retrieval in Vision-Language Models

Add code
Apr 09, 2026
Viaarxiv icon

AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation

Add code
Mar 24, 2026
Viaarxiv icon

DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models

Add code
Mar 24, 2026
Viaarxiv icon

TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation

Add code
Mar 24, 2026
Viaarxiv icon

Repurposing Geometric Foundation Models for Multi-view Diffusion

Add code
Mar 23, 2026
Viaarxiv icon

WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

Add code
Mar 17, 2026
Viaarxiv icon

Grounding World Simulation Models in a Real-World Metropolis

Add code
Mar 16, 2026
Viaarxiv icon

Pri4R: Learning World Dynamics for Vision-Language-Action Models with Privileged 4D Representation

Add code
Mar 02, 2026
Viaarxiv icon