Picture for Tejas Gokhale

Tejas Gokhale

TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives

Add code
Nov 04, 2024
Figure 1 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 2 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 3 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Figure 4 for TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
Viaarxiv icon

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models

Add code
Aug 05, 2024
Viaarxiv icon

Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images

Add code
May 24, 2024
Figure 1 for Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
Figure 2 for Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
Figure 3 for Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
Figure 4 for Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images
Viaarxiv icon

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation

Add code
Apr 12, 2024
Figure 1 for On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Figure 2 for On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Figure 3 for On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Figure 4 for On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
Viaarxiv icon

Improving Shift Invariance in Convolutional Neural Networks with Translation Invariant Polyphase Sampling

Add code
Apr 11, 2024
Viaarxiv icon

Getting it Right: Improving Spatial Consistency in Text-to-Image Models

Add code
Apr 01, 2024
Viaarxiv icon

Adversarial Bayesian Augmentation for Single-Source Domain Generalization

Add code
Jul 18, 2023
Viaarxiv icon

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

Add code
Jun 07, 2023
Viaarxiv icon

End-to-end Knowledge Retrieval with Multi-modal Queries

Add code
Jun 01, 2023
Viaarxiv icon

Mole Recruitment: Poisoning of Image Classifiers via Selective Batch Sampling

Add code
Mar 30, 2023
Viaarxiv icon