Alert button
Picture for Tanmay Gupta

Tanmay Gupta

Alert button

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Add code
Bookmark button
Alert button
Mar 21, 2024
Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna

Figure 1 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 2 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 3 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 4 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Add code
Bookmark button
Alert button
Feb 23, 2024
Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu

Viaarxiv icon

Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Add code
Bookmark button
Alert button
Dec 05, 2023
Kiana Ehsani, Tanmay Gupta, Rose Hendrix, Jordi Salvador, Luca Weihs, Kuo-Hao Zeng, Kunal Pratap Singh, Yejin Kim, Winson Han, Alvaro Herrasti, Ranjay Krishna, Dustin Schwenk, Eli VanderBilt, Aniruddha Kembhavi

Figure 1 for Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Figure 2 for Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Figure 3 for Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Figure 4 for Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
Viaarxiv icon

OBJECT 3DIT: Language-guided 3D-aware Image Editing

Add code
Bookmark button
Alert button
Jul 20, 2023
Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, Tanmay Gupta

Figure 1 for OBJECT 3DIT: Language-guided 3D-aware Image Editing
Figure 2 for OBJECT 3DIT: Language-guided 3D-aware Image Editing
Figure 3 for OBJECT 3DIT: Language-guided 3D-aware Image Editing
Figure 4 for OBJECT 3DIT: Language-guided 3D-aware Image Editing
Viaarxiv icon

Visual Programming: Compositional visual reasoning without training

Add code
Bookmark button
Alert button
Nov 18, 2022
Tanmay Gupta, Aniruddha Kembhavi

Figure 1 for Visual Programming: Compositional visual reasoning without training
Figure 2 for Visual Programming: Compositional visual reasoning without training
Figure 3 for Visual Programming: Compositional visual reasoning without training
Figure 4 for Visual Programming: Compositional visual reasoning without training
Viaarxiv icon

GRIT: General Robust Image Task Benchmark

Add code
Bookmark button
Alert button
May 02, 2022
Tanmay Gupta, Ryan Marten, Aniruddha Kembhavi, Derek Hoiem

Figure 1 for GRIT: General Robust Image Task Benchmark
Figure 2 for GRIT: General Robust Image Task Benchmark
Figure 3 for GRIT: General Robust Image Task Benchmark
Figure 4 for GRIT: General Robust Image Task Benchmark
Viaarxiv icon

Webly Supervised Concept Expansion for General Purpose Vision Models

Add code
Bookmark button
Alert button
Feb 04, 2022
Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi

Figure 1 for Webly Supervised Concept Expansion for General Purpose Vision Models
Figure 2 for Webly Supervised Concept Expansion for General Purpose Vision Models
Figure 3 for Webly Supervised Concept Expansion for General Purpose Vision Models
Figure 4 for Webly Supervised Concept Expansion for General Purpose Vision Models
Viaarxiv icon

Visual Semantic Role Labeling for Video Understanding

Add code
Bookmark button
Alert button
Apr 02, 2021
Arka Sadhu, Tanmay Gupta, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi

Figure 1 for Visual Semantic Role Labeling for Video Understanding
Figure 2 for Visual Semantic Role Labeling for Video Understanding
Figure 3 for Visual Semantic Role Labeling for Video Understanding
Figure 4 for Visual Semantic Role Labeling for Video Understanding
Viaarxiv icon

Towards General Purpose Vision Systems

Add code
Bookmark button
Alert button
Apr 01, 2021
Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem

Figure 1 for Towards General Purpose Vision Systems
Figure 2 for Towards General Purpose Vision Systems
Figure 3 for Towards General Purpose Vision Systems
Figure 4 for Towards General Purpose Vision Systems
Viaarxiv icon