Alert button
Picture for Michael Wray

Michael Wray

Alert button

Are you Struggling? Dataset and Baselines for Struggle Determination in Assembly Videos

Feb 28, 2024
Shijia Feng, Michael Wray, Brian Sullivan, Youngkyoon Jang, Casimir Ludwig, Iain Gilchrist, Walterio Mayol-Cuevas

Viaarxiv icon

Video Editing for Video Retrieval

Feb 04, 2024
Bin Zhu, Kevin Flanagan, Adriano Fragomeni, Michael Wray, Dima Damen

Viaarxiv icon

GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos

Dec 12, 2023
Tomáš Souček, Dima Damen, Michael Wray, Ivan Laptev, Josef Sivic

Figure 1 for GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Figure 2 for GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Figure 3 for GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Figure 4 for GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Learning Temporal Sentence Grounding From Narrated EgoVideos

Oct 26, 2023
Kevin Flanagan, Dima Damen, Michael Wray

Figure 1 for Learning Temporal Sentence Grounding From Narrated EgoVideos
Figure 2 for Learning Temporal Sentence Grounding From Narrated EgoVideos
Figure 3 for Learning Temporal Sentence Grounding From Narrated EgoVideos
Figure 4 for Learning Temporal Sentence Grounding From Narrated EgoVideos
Viaarxiv icon

ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval

Oct 09, 2022
Adriano Fragomeni, Michael Wray, Dima Damen

Figure 1 for ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
Figure 2 for ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
Figure 3 for ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
Figure 4 for ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval
Viaarxiv icon

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022

Jul 04, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 2 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 3 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Figure 4 for Egocentric Video-Language Pretraining @ Ego4D Challenge 2022
Viaarxiv icon

Egocentric Video-Language Pretraining

Jun 03, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining
Figure 2 for Egocentric Video-Language Pretraining
Figure 3 for Egocentric Video-Language Pretraining
Figure 4 for Egocentric Video-Language Pretraining
Viaarxiv icon

Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval

Oct 25, 2021
Jonathan Munro, Michael Wray, Diane Larlus, Gabriela Csurka, Dima Damen

Figure 1 for Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval
Figure 2 for Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval
Figure 3 for Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval
Figure 4 for Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval
Viaarxiv icon