Picture for Md Mohaiminul Islam

Md Mohaiminul Islam

Video ReCap: Recursive Captioning of Hour-Long Videos

Add code
Feb 28, 2024
Viaarxiv icon

A Simple LLM Framework for Long-Range Video Question-Answering

Add code
Dec 28, 2023
Viaarxiv icon

RGNet: A Unified Retrieval and Grounding Network for Long Videos

Add code
Dec 11, 2023
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Nov 30, 2023
Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Efficient Movie Scene Detection using State-Space Transformers

Add code
Dec 29, 2022
Figure 1 for Efficient Movie Scene Detection using State-Space Transformers
Figure 2 for Efficient Movie Scene Detection using State-Space Transformers
Figure 3 for Efficient Movie Scene Detection using State-Space Transformers
Figure 4 for Efficient Movie Scene Detection using State-Space Transformers
Viaarxiv icon

Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism

Add code
Jul 24, 2022
Figure 1 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 2 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 3 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 4 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Viaarxiv icon

Long Movie Clip Classification with State-Space Video Models

Add code
Apr 04, 2022
Figure 1 for Long Movie Clip Classification with State-Space Video Models
Figure 2 for Long Movie Clip Classification with State-Space Video Models
Figure 3 for Long Movie Clip Classification with State-Space Video Models
Figure 4 for Long Movie Clip Classification with State-Space Video Models
Viaarxiv icon