Alert button
Picture for Ankush Gupta

Ankush Gupta

Alert button

BootsTAP: Bootstrapped Training for Tracking-Any-Point

Add code
Bookmark button
Alert button
Feb 01, 2024
Carl Doersch, Yi Yang, Dilara Gokay, Pauline Luc, Skanda Koppula, Ankush Gupta, Joseph Heyward, Ross Goroshin, João Carreira, Andrew Zisserman

Viaarxiv icon

Helping Hands: An Object-Aware Ego-Centric Video Recognition Model

Add code
Bookmark button
Alert button
Aug 15, 2023
Chuhan Zhang, Ankush Gupta, Andrew Zisserman

Viaarxiv icon

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement

Add code
Bookmark button
Alert button
Jun 14, 2023
Carl Doersch, Yi Yang, Mel Vecerik, Dilara Gokay, Ankush Gupta, Yusuf Aytar, Joao Carreira, Andrew Zisserman

Figure 1 for TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Figure 2 for TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Figure 3 for TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Figure 4 for TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Viaarxiv icon

Perception Test: A Diagnostic Benchmark for Multimodal Video Models

Add code
Bookmark button
Alert button
May 23, 2023
Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira

Figure 1 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 2 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 3 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Figure 4 for Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viaarxiv icon

SuS-X: Training-Free Name-Only Transfer of Vision-Language Models

Add code
Bookmark button
Alert button
Nov 28, 2022
Vishaal Udandarao, Ankush Gupta, Samuel Albanie

Figure 1 for SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Figure 2 for SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Figure 3 for SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Figure 4 for SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
Viaarxiv icon

TAP-Vid: A Benchmark for Tracking Any Point in a Video

Add code
Bookmark button
Alert button
Nov 07, 2022
Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang

Figure 1 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 2 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 3 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Figure 4 for TAP-Vid: A Benchmark for Tracking Any Point in a Video
Viaarxiv icon

Is an Object-Centric Video Representation Beneficial for Transfer?

Add code
Bookmark button
Alert button
Jul 20, 2022
Chuhan Zhang, Ankush Gupta, Andrew Zisserman

Figure 1 for Is an Object-Centric Video Representation Beneficial for Transfer?
Figure 2 for Is an Object-Centric Video Representation Beneficial for Transfer?
Figure 3 for Is an Object-Centric Video Representation Beneficial for Transfer?
Figure 4 for Is an Object-Centric Video Representation Beneficial for Transfer?
Viaarxiv icon

Temporal Query Networks for Fine-grained Video Understanding

Add code
Bookmark button
Alert button
Apr 19, 2021
Chuhan Zhang, Ankush Gupta, Andrew Zisserman

Figure 1 for Temporal Query Networks for Fine-grained Video Understanding
Figure 2 for Temporal Query Networks for Fine-grained Video Understanding
Figure 3 for Temporal Query Networks for Fine-grained Video Understanding
Figure 4 for Temporal Query Networks for Fine-grained Video Understanding
Viaarxiv icon

Representation Matters: Improving Perception and Exploration for Robotics

Add code
Bookmark button
Alert button
Nov 03, 2020
Markus Wulfmeier, Arunkumar Byravan, Tim Hertweck, Irina Higgins, Ankush Gupta, Tejas Kulkarni, Malcolm Reynolds, Denis Teplyashin, Roland Hafner, Thomas Lampe, Martin Riedmiller

Figure 1 for Representation Matters: Improving Perception and Exploration for Robotics
Figure 2 for Representation Matters: Improving Perception and Exploration for Robotics
Figure 3 for Representation Matters: Improving Perception and Exploration for Robotics
Figure 4 for Representation Matters: Improving Perception and Exploration for Robotics
Viaarxiv icon

Adaptive Text Recognition through Visual Matching

Add code
Bookmark button
Alert button
Sep 14, 2020
Chuhan Zhang, Ankush Gupta, Andrew Zisserman

Figure 1 for Adaptive Text Recognition through Visual Matching
Figure 2 for Adaptive Text Recognition through Visual Matching
Figure 3 for Adaptive Text Recognition through Visual Matching
Figure 4 for Adaptive Text Recognition through Visual Matching
Viaarxiv icon