Picture for Kyle Min

Kyle Min

EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs

Add code
Jun 06, 2025
Viaarxiv icon

Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection

Add code
May 13, 2025
Viaarxiv icon

DecompDreamer: Advancing Structured 3D Asset Generation with Multi-Object Decomposition and Gaussian Splatting

Add code
Mar 15, 2025
Viaarxiv icon

Graph-Based Multimodal and Multi-view Alignment for Keystep Recognition

Add code
Jan 07, 2025
Viaarxiv icon

Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation

Add code
Aug 12, 2024
Figure 1 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 2 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 3 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Figure 4 for Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
Viaarxiv icon

Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation

Add code
Jul 28, 2024
Figure 1 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 2 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 3 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Figure 4 for Ego-VPA: Egocentric Video Understanding with Parameter-efficient Adaptation
Viaarxiv icon

SViTT-Ego: A Sparse Video-Text Transformer for Egocentric Video

Add code
Jun 13, 2024
Viaarxiv icon

Contrastive Language Video Time Pre-training

Add code
Jun 04, 2024
Viaarxiv icon

R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model

Add code
May 25, 2024
Figure 1 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 2 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 3 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Figure 4 for R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model
Viaarxiv icon

Action Scene Graphs for Long-Form Understanding of Egocentric Videos

Add code
Dec 06, 2023
Viaarxiv icon