Picture for Vibhav Vineet

Vibhav Vineet

Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models

Add code
Jun 21, 2024
Figure 1 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 2 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 3 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 4 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Viaarxiv icon

Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Add code
Jun 16, 2024
Figure 1 for Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
Figure 2 for Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
Figure 3 for Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
Figure 4 for Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
Viaarxiv icon

Navigating Hallucinations for Reasoning of Unintentional Activities

Add code
Mar 03, 2024
Viaarxiv icon

DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models

Add code
Dec 21, 2023
Viaarxiv icon

PEEKABOO: Interactive Video Generation via Masked-Diffusion

Add code
Dec 12, 2023
Viaarxiv icon

DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets

Add code
Nov 08, 2023
Figure 1 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets
Figure 2 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets
Figure 3 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets
Figure 4 for DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets
Viaarxiv icon

Efficiently Robustify Pre-trained Models

Add code
Sep 14, 2023
Figure 1 for Efficiently Robustify Pre-trained Models
Figure 2 for Efficiently Robustify Pre-trained Models
Figure 3 for Efficiently Robustify Pre-trained Models
Figure 4 for Efficiently Robustify Pre-trained Models
Viaarxiv icon

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation

Add code
Sep 12, 2023
Figure 1 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 2 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 3 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 4 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Viaarxiv icon

Robustness Analysis on Foundational Segmentation Models

Add code
Jun 15, 2023
Figure 1 for Robustness Analysis on Foundational Segmentation Models
Figure 2 for Robustness Analysis on Foundational Segmentation Models
Figure 3 for Robustness Analysis on Foundational Segmentation Models
Figure 4 for Robustness Analysis on Foundational Segmentation Models
Viaarxiv icon

Benchmarking self-supervised video representation learning

Add code
Jun 09, 2023
Figure 1 for Benchmarking self-supervised video representation learning
Figure 2 for Benchmarking self-supervised video representation learning
Figure 3 for Benchmarking self-supervised video representation learning
Figure 4 for Benchmarking self-supervised video representation learning
Viaarxiv icon