Alert button
Picture for Medhini Narasimhan

Medhini Narasimhan

Alert button

Modular Visual Question Answering via Code Generation

Add code
Bookmark button
Alert button
Jun 08, 2023
Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell, Dan Klein

Figure 1 for Modular Visual Question Answering via Code Generation
Figure 2 for Modular Visual Question Answering via Code Generation
Figure 3 for Modular Visual Question Answering via Code Generation
Figure 4 for Modular Visual Question Answering via Code Generation
Viaarxiv icon

Learning and Verification of Task Structure in Instructional Videos

Add code
Bookmark button
Alert button
Mar 23, 2023
Medhini Narasimhan, Licheng Yu, Sean Bell, Ning Zhang, Trevor Darrell

Figure 1 for Learning and Verification of Task Structure in Instructional Videos
Figure 2 for Learning and Verification of Task Structure in Instructional Videos
Figure 3 for Learning and Verification of Task Structure in Instructional Videos
Figure 4 for Learning and Verification of Task Structure in Instructional Videos
Viaarxiv icon

TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency

Add code
Bookmark button
Alert button
Aug 14, 2022
Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid

Figure 1 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 2 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 3 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Figure 4 for TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency
Viaarxiv icon

Multi-Person 3D Motion Prediction with Multi-Range Transformers

Add code
Bookmark button
Alert button
Nov 23, 2021
Jiashun Wang, Huazhe Xu, Medhini Narasimhan, Xiaolong Wang

Figure 1 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Figure 2 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Figure 3 for Multi-Person 3D Motion Prediction with Multi-Range Transformers
Viaarxiv icon

CLIP-It! Language-Guided Video Summarization

Add code
Bookmark button
Alert button
Jul 01, 2021
Medhini Narasimhan, Anna Rohrbach, Trevor Darrell

Figure 1 for CLIP-It! Language-Guided Video Summarization
Figure 2 for CLIP-It! Language-Guided Video Summarization
Figure 3 for CLIP-It! Language-Guided Video Summarization
Figure 4 for CLIP-It! Language-Guided Video Summarization
Viaarxiv icon

Strumming to the Beat: Audio-Conditioned Contrastive Video Textures

Add code
Bookmark button
Alert button
Apr 06, 2021
Medhini Narasimhan, Shiry Ginosar, Andrew Owens, Alexei A. Efros, Trevor Darrell

Figure 1 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 2 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 3 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Figure 4 for Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
Viaarxiv icon

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

Add code
Bookmark button
Alert button
Jul 20, 2020
Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh

Figure 1 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 2 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 3 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Figure 4 for Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
Viaarxiv icon

Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering

Add code
Bookmark button
Alert button
Nov 01, 2018
Medhini Narasimhan, Svetlana Lazebnik, Alexander G. Schwing

Figure 1 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 2 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 3 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Figure 4 for Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Viaarxiv icon

Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering

Add code
Bookmark button
Alert button
Sep 04, 2018
Medhini Narasimhan, Alexander G. Schwing

Figure 1 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 2 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 3 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Figure 4 for Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Viaarxiv icon