Picture for Anil Batra

Anil Batra

Predicting Implicit Arguments in Procedural Video Instructions

Add code
May 27, 2025
Viaarxiv icon

CAST: Cross-modal Alignment Similarity Test for Vision Language Models

Add code
Sep 17, 2024
Figure 1 for CAST: Cross-modal Alignment Similarity Test for Vision Language Models
Figure 2 for CAST: Cross-modal Alignment Similarity Test for Vision Language Models
Figure 3 for CAST: Cross-modal Alignment Similarity Test for Vision Language Models
Figure 4 for CAST: Cross-modal Alignment Similarity Test for Vision Language Models
Viaarxiv icon

Efficient Pre-training for Localized Instruction Generation of Videos

Add code
Nov 27, 2023
Figure 1 for Efficient Pre-training for Localized Instruction Generation of Videos
Figure 2 for Efficient Pre-training for Localized Instruction Generation of Videos
Figure 3 for Efficient Pre-training for Localized Instruction Generation of Videos
Figure 4 for Efficient Pre-training for Localized Instruction Generation of Videos
Viaarxiv icon

Image generation with shortest path diffusion

Add code
Jun 01, 2023
Viaarxiv icon

A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

Add code
Oct 07, 2022
Figure 1 for A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos
Figure 2 for A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos
Figure 3 for A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos
Figure 4 for A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos
Viaarxiv icon