Picture for Minsu Cho

Minsu Cho

MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers

Add code
Nov 28, 2024
Figure 1 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 2 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 3 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Figure 4 for MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
Viaarxiv icon

3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction

Add code
Nov 04, 2024
Viaarxiv icon

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Add code
Aug 09, 2024
Viaarxiv icon

Online Temporal Action Localization with Memory-Augmented Transformer

Add code
Aug 06, 2024
Figure 1 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 2 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 3 for Online Temporal Action Localization with Memory-Augmented Transformer
Figure 4 for Online Temporal Action Localization with Memory-Augmented Transformer
Viaarxiv icon

Classification Matters: Improving Video Action Detection with Class-Specific Attention

Add code
Jul 29, 2024
Figure 1 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 2 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 3 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Figure 4 for Classification Matters: Improving Video Action Detection with Class-Specific Attention
Viaarxiv icon

3D Geometric Shape Assembly via Efficient Point Cloud Matching

Add code
Jul 15, 2024
Figure 1 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 2 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 3 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Figure 4 for 3D Geometric Shape Assembly via Efficient Point Cloud Matching
Viaarxiv icon

Burst Image Super-Resolution with Base Frame Selection

Add code
Jun 25, 2024
Viaarxiv icon

Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation

Add code
Apr 26, 2024
Viaarxiv icon

Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform

Add code
Apr 17, 2024
Figure 1 for Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform
Figure 2 for Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform
Figure 3 for Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform
Figure 4 for Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform
Viaarxiv icon

Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences

Add code
Apr 16, 2024
Figure 1 for Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences
Figure 2 for Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences
Figure 3 for Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences
Figure 4 for Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences
Viaarxiv icon