Alert button
Picture for David Crandall

David Crandall

Alert button

Transformer for Object Re-Identification: A Survey

Add code
Bookmark button
Alert button
Jan 13, 2024
Mang Ye, Shuoyi Chen, Chenyue Li, Wei-Shi Zheng, David Crandall, Bo Du

Viaarxiv icon

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Add code
Bookmark button
Alert button
Nov 30, 2023
Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zach Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, Maria Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina Gonzalez, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbelaez, Gedas Bertasius, David Crandall, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shou, Michael Wray

Figure 1 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 2 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 3 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Figure 4 for Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Viaarxiv icon

Situated Cameras, Situated Knowledges: Towards an Egocentric Epistemology for Computer Vision

Add code
Bookmark button
Alert button
Jun 30, 2023
Samuel Goree, David Crandall

Figure 1 for Situated Cameras, Situated Knowledges: Towards an Egocentric Epistemology for Computer Vision
Viaarxiv icon

A Tensor-based Convolutional Neural Network for Small Dataset Classification

Add code
Bookmark button
Alert button
Mar 29, 2023
Zhenhua Chen, David Crandall

Figure 1 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 2 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 3 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Figure 4 for A Tensor-based Convolutional Neural Network for Small Dataset Classification
Viaarxiv icon

SePaint: Semantic Map Inpainting via Multinomial Diffusion

Add code
Bookmark button
Alert button
Mar 05, 2023
Zheng Chen, Deepak Duggirala, David Crandall, Lei Jiang, Lantao Liu

Figure 1 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 2 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 3 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Figure 4 for SePaint: Semantic Map Inpainting via Multinomial Diffusion
Viaarxiv icon

LoCoNet: Long-Short Context Network for Active Speaker Detection

Add code
Bookmark button
Alert button
Jan 19, 2023
Xizi Wang, Feng Cheng, Gedas Bertasius, David Crandall

Figure 1 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 2 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 3 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Figure 4 for LoCoNet: Long-Short Context Network for Active Speaker Detection
Viaarxiv icon

VindLU: A Recipe for Effective Video-and-Language Pretraining

Add code
Bookmark button
Alert button
Dec 09, 2022
Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius

Figure 1 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 2 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 3 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Figure 4 for VindLU: A Recipe for Effective Video-and-Language Pretraining
Viaarxiv icon

Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper

Add code
Bookmark button
Alert button
Sep 22, 2022
Samuel Goree, Gabriel Appleby, David Crandall, Norman Su

Figure 1 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 2 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 3 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Figure 4 for Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper
Viaarxiv icon

Action Recognition based on Cross-Situational Action-object Statistics

Add code
Bookmark button
Alert button
Aug 15, 2022
Satoshi Tsutsui, Xizi Wang, Guangyuan Weng, Yayun Zhang, David Crandall, Chen Yu

Figure 1 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 2 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 3 for Action Recognition based on Cross-Situational Action-object Statistics
Figure 4 for Action Recognition based on Cross-Situational Action-object Statistics
Viaarxiv icon

Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds

Add code
Bookmark button
Alert button
Jul 26, 2022
Junbo Yin, Jianbing Shen, Xin Gao, David Crandall, Ruigang Yang

Figure 1 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 2 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 3 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Figure 4 for Graph Neural Network and Spatiotemporal Transformer Attention for 3D Video Object Detection from Point Clouds
Viaarxiv icon