Picture for Neel Joshi

Neel Joshi

Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models

Add code
Jun 21, 2024
Figure 1 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 2 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 3 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 4 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Viaarxiv icon

HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

Add code
Sep 29, 2023
Figure 1 for HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Figure 2 for HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Figure 3 for HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Figure 4 for HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Viaarxiv icon

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation

Add code
Sep 12, 2023
Figure 1 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 2 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 3 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Figure 4 for Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Viaarxiv icon

Controllable Text-to-Image Generation with GPT-4

Add code
May 29, 2023
Figure 1 for Controllable Text-to-Image Generation with GPT-4
Figure 2 for Controllable Text-to-Image Generation with GPT-4
Figure 3 for Controllable Text-to-Image Generation with GPT-4
Figure 4 for Controllable Text-to-Image Generation with GPT-4
Viaarxiv icon

Neural-Sim: Learning to Generate Training Data with NeRF

Add code
Jul 22, 2022
Figure 1 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 2 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 3 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 4 for Neural-Sim: Learning to Generate Training Data with NeRF
Viaarxiv icon

Scaling Novel Object Detection with Weakly Supervised Detection Transformers

Add code
Jul 11, 2022
Figure 1 for Scaling Novel Object Detection with Weakly Supervised Detection Transformers
Figure 2 for Scaling Novel Object Detection with Weakly Supervised Detection Transformers
Figure 3 for Scaling Novel Object Detection with Weakly Supervised Detection Transformers
Figure 4 for Scaling Novel Object Detection with Weakly Supervised Detection Transformers
Viaarxiv icon

Visual Attention Emerges from Recurrent Sparse Reconstruction

Add code
Apr 23, 2022
Figure 1 for Visual Attention Emerges from Recurrent Sparse Reconstruction
Figure 2 for Visual Attention Emerges from Recurrent Sparse Reconstruction
Figure 3 for Visual Attention Emerges from Recurrent Sparse Reconstruction
Figure 4 for Visual Attention Emerges from Recurrent Sparse Reconstruction
Viaarxiv icon

One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning

Add code
Mar 15, 2022
Figure 1 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 2 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 3 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Figure 4 for One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning
Viaarxiv icon

Robust Contrastive Learning against Noisy Views

Add code
Jan 12, 2022
Figure 1 for Robust Contrastive Learning against Noisy Views
Figure 2 for Robust Contrastive Learning against Noisy Views
Figure 3 for Robust Contrastive Learning against Noisy Views
Figure 4 for Robust Contrastive Learning against Noisy Views
Viaarxiv icon

Deep Depth Prior for Multi-View Stereo

Add code
Jan 21, 2020
Figure 1 for Deep Depth Prior for Multi-View Stereo
Figure 2 for Deep Depth Prior for Multi-View Stereo
Figure 3 for Deep Depth Prior for Multi-View Stereo
Figure 4 for Deep Depth Prior for Multi-View Stereo
Viaarxiv icon