Picture for James J. Little

James J. Little

The Power of One: A Single Example is All it Takes for Segmentation in VLMs

Add code
Mar 13, 2025
Figure 1 for The Power of One: A Single Example is All it Takes for Segmentation in VLMs
Figure 2 for The Power of One: A Single Example is All it Takes for Segmentation in VLMs
Figure 3 for The Power of One: A Single Example is All it Takes for Segmentation in VLMs
Figure 4 for The Power of One: A Single Example is All it Takes for Segmentation in VLMs
Viaarxiv icon

MM-R$^3$: On (In-)Consistency of Multi-modal Large Language Models (MLLMs)

Add code
Oct 07, 2024
Viaarxiv icon

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Add code
Apr 17, 2024
Viaarxiv icon

Implicit and Explicit Commonsense for Multi-sentence Video Captioning

Add code
Mar 14, 2023
Figure 1 for Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Figure 2 for Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Figure 3 for Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Figure 4 for Implicit and Explicit Commonsense for Multi-sentence Video Captioning
Viaarxiv icon

Semantically Enhanced Global Reasoning for Semantic Segmentation

Add code
Dec 06, 2022
Figure 1 for Semantically Enhanced Global Reasoning for Semantic Segmentation
Figure 2 for Semantically Enhanced Global Reasoning for Semantic Segmentation
Figure 3 for Semantically Enhanced Global Reasoning for Semantic Segmentation
Figure 4 for Semantically Enhanced Global Reasoning for Semantic Segmentation
Viaarxiv icon

Bootstrapping Human Optical Flow and Pose

Add code
Oct 28, 2022
Viaarxiv icon

UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields

Add code
Jun 23, 2022
Figure 1 for UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields
Figure 2 for UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields
Figure 3 for UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields
Figure 4 for UNeRF: Time and Memory Conscious U-Shaped Network for Training Neural Radiance Fields
Viaarxiv icon

ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses

Add code
Dec 14, 2021
Figure 1 for ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Figure 2 for ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Figure 3 for ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Figure 4 for ElePose: Unsupervised 3D Human Pose Estimation by Predicting Camera Elevation and Learning Normalizing Flows on 2D Poses
Viaarxiv icon

OptiBox: Breaking the Limits of Proposals for Visual Grounding

Add code
Nov 29, 2019
Figure 1 for OptiBox: Breaking the Limits of Proposals for Visual Grounding
Figure 2 for OptiBox: Breaking the Limits of Proposals for Visual Grounding
Figure 3 for OptiBox: Breaking the Limits of Proposals for Visual Grounding
Figure 4 for OptiBox: Breaking the Limits of Proposals for Visual Grounding
Viaarxiv icon

Pan-tilt-zoom SLAM for Sports Videos

Add code
Jul 20, 2019
Figure 1 for Pan-tilt-zoom SLAM for Sports Videos
Figure 2 for Pan-tilt-zoom SLAM for Sports Videos
Figure 3 for Pan-tilt-zoom SLAM for Sports Videos
Figure 4 for Pan-tilt-zoom SLAM for Sports Videos
Viaarxiv icon