Picture for Minsu Cho

Minsu Cho

SpaCeFormer: Fast Proposal-Free Open-Vocabulary 3D Instance Segmentation

Add code
Apr 22, 2026
Viaarxiv icon

Exploring High-Order Self-Similarity for Video Understanding

Add code
Apr 22, 2026
Viaarxiv icon

Cog3DMap: Multi-View Vision-Language Reasoning with 3D Cognitive Maps

Add code
Mar 24, 2026
Viaarxiv icon

Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards

Add code
Mar 05, 2026
Viaarxiv icon

Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

Add code
Mar 05, 2026
Viaarxiv icon

Space-Time Forecasting of Dynamic Scenes with Motion-aware Gaussian Grouping

Add code
Feb 25, 2026
Viaarxiv icon

Vision-aligned Latent Reasoning for Multi-modal Large Language Model

Add code
Feb 04, 2026
Viaarxiv icon

MV-SAM: Multi-view Promptable Segmentation using Pointmap Guidance

Add code
Jan 25, 2026
Viaarxiv icon

DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning

Add code
Jan 22, 2026
Viaarxiv icon

Affostruction: 3D Affordance Grounding with Generative Reconstruction

Add code
Jan 14, 2026
Viaarxiv icon