Alert button
Picture for Md Mohaiminul Islam

Md Mohaiminul Islam

Alert button

Video ReCap: Recursive Captioning of Hour-Long Videos

Add code
Bookmark button
Alert button
Feb 28, 2024
Md Mohaiminul Islam, Ngan Ho, Xitong Yang, Tushar Nagarajan, Lorenzo Torresani, Gedas Bertasius

Viaarxiv icon

A Simple LLM Framework for Long-Range Video Question-Answering

Add code
Bookmark button
Alert button
Dec 28, 2023
Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius

Viaarxiv icon

RGNet: A Unified Retrieval and Grounding Network for Long Videos

Add code
Bookmark button
Alert button
Dec 11, 2023
Tanveer Hannan, Md Mohaiminul Islam, Thomas Seidl, Gedas Bertasius

Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Bookmark button
Alert button
Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Efficient Movie Scene Detection using State-Space Transformers

Add code
Bookmark button
Alert button
Dec 29, 2022
Md Mohaiminul Islam, Mahmudul Hasan, Kishan Shamsundar Athrey, Tony Braskich, Gedas Bertasius

Figure 1 for Efficient Movie Scene Detection using State-Space Transformers
Figure 2 for Efficient Movie Scene Detection using State-Space Transformers
Figure 3 for Efficient Movie Scene Detection using State-Space Transformers
Figure 4 for Efficient Movie Scene Detection using State-Space Transformers
Viaarxiv icon

Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism

Add code
Bookmark button
Alert button
Jul 24, 2022
Md Mohaiminul Islam, Gedas Bertasius

Figure 1 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 2 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 3 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Figure 4 for Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
Viaarxiv icon

Long Movie Clip Classification with State-Space Video Models

Add code
Bookmark button
Alert button
Apr 04, 2022
Md Mohaiminul Islam, Gedas Bertasius

Figure 1 for Long Movie Clip Classification with State-Space Video Models
Figure 2 for Long Movie Clip Classification with State-Space Video Models
Figure 3 for Long Movie Clip Classification with State-Space Video Models
Figure 4 for Long Movie Clip Classification with State-Space Video Models
Viaarxiv icon