Picture for Kanishk Jain

Kanishk Jain

Benchmarking Vision Language Models for Cultural Understanding

Add code
Jul 15, 2024
Figure 1 for Benchmarking Vision Language Models for Cultural Understanding
Figure 2 for Benchmarking Vision Language Models for Cultural Understanding
Figure 3 for Benchmarking Vision Language Models for Cultural Understanding
Figure 4 for Benchmarking Vision Language Models for Cultural Understanding
Viaarxiv icon

Instance-Level Semantic Maps for Vision Language Navigation

Add code
May 23, 2023
Figure 1 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 2 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 3 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 4 for Instance-Level Semantic Maps for Vision Language Navigation
Viaarxiv icon

Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification

Add code
Feb 01, 2023
Figure 1 for Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
Figure 2 for Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
Figure 3 for Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
Figure 4 for Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification
Viaarxiv icon

Ground then Navigate: Language-guided Navigation in Dynamic Scenes

Add code
Sep 24, 2022
Figure 1 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 2 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 3 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 4 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Viaarxiv icon

Grounding Linguistic Commands to Navigable Regions

Add code
Dec 24, 2021
Figure 1 for Grounding Linguistic Commands to Navigable Regions
Figure 2 for Grounding Linguistic Commands to Navigable Regions
Figure 3 for Grounding Linguistic Commands to Navigable Regions
Figure 4 for Grounding Linguistic Commands to Navigable Regions
Viaarxiv icon

Comprehensive Multi-Modal Interactions for Referring Image Segmentation

Add code
Apr 21, 2021
Figure 1 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 2 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 3 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Figure 4 for Comprehensive Multi-Modal Interactions for Referring Image Segmentation
Viaarxiv icon