Picture for Minsu Cho

Minsu Cho

Harnessing the Power of Training-Free Techniques in Text-to-2D Generation for Text-to-3D Generation via Score Distillation Sampling

Add code
May 26, 2025
Viaarxiv icon

Locality-Aware Zero-Shot Human-Object Interaction Detection

Add code
May 26, 2025
Viaarxiv icon

Video Summarization with Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

Memory-Modular Classification: Learning to Generalize with Memory Replacement

Add code
Apr 08, 2025
Viaarxiv icon

Leveraging 3D Geometric Priors in 2D Rotation Symmetry Detection

Add code
Mar 27, 2025
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Viaarxiv icon

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Add code
Dec 05, 2024
Viaarxiv icon

RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos

Add code
Dec 04, 2024
Viaarxiv icon

MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers

Add code
Nov 28, 2024
Figure 1 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 2 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 3 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 4 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Viaarxiv icon

3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction

Add code
Nov 04, 2024
Viaarxiv icon