Glips


Generating Vision-Language Navigation Instructions Incorporated Fine-Grained Alignment Annotations

Add code
Jun 10, 2025
Viaarxiv icon

Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures

Add code
May 16, 2025
Viaarxiv icon

GLIP-OOD: Zero-Shot Graph OOD Detection with Foundation Model

Add code
Apr 29, 2025
Viaarxiv icon

Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection

Add code
Feb 22, 2025
Viaarxiv icon

NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation

Add code
Nov 13, 2024
Viaarxiv icon

Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation

Add code
Oct 29, 2024
Figure 1 for Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation
Figure 2 for Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation
Figure 3 for Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation
Figure 4 for Reliable Semantic Understanding for Real World Zero-shot Object Goal Navigation
Viaarxiv icon

AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

Add code
Oct 22, 2024
Figure 1 for AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Figure 2 for AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Figure 3 for AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Figure 4 for AttriPrompter: Auto-Prompting with Attribute Semantics for Zero-shot Nuclei Detection via Visual-Language Pre-trained Models
Viaarxiv icon

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

Add code
Oct 11, 2024
Viaarxiv icon

SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models

Add code
Jul 16, 2024
Figure 1 for SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Figure 2 for SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Figure 3 for SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Figure 4 for SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Viaarxiv icon

Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images

Add code
May 16, 2024
Figure 1 for Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
Figure 2 for Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
Figure 3 for Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
Figure 4 for Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images
Viaarxiv icon