Panoptic Segmentation


Panoptic segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to things classes (countable objects with instances, like cars and people) are assigned unique instance IDs.

Panoptic-SLAM: Visual SLAM in Dynamic Environments using Panoptic Segmentation

Add code
May 03, 2024
Viaarxiv icon

4D Panoptic Scene Graph Generation

Add code
May 16, 2024
Figure 1 for 4D Panoptic Scene Graph Generation
Figure 2 for 4D Panoptic Scene Graph Generation
Figure 3 for 4D Panoptic Scene Graph Generation
Figure 4 for 4D Panoptic Scene Graph Generation
Viaarxiv icon

A Unified Framework for 3D Scene Understanding

Add code
Jul 03, 2024
Figure 1 for A Unified Framework for 3D Scene Understanding
Figure 2 for A Unified Framework for 3D Scene Understanding
Figure 3 for A Unified Framework for 3D Scene Understanding
Figure 4 for A Unified Framework for 3D Scene Understanding
Viaarxiv icon

Efficient Robot Learning for Perception and Mapping

Add code
May 23, 2024
Viaarxiv icon

General and Task-Oriented Video Segmentation

Add code
Jul 09, 2024
Figure 1 for General and Task-Oriented Video Segmentation
Figure 2 for General and Task-Oriented Video Segmentation
Figure 3 for General and Task-Oriented Video Segmentation
Figure 4 for General and Task-Oriented Video Segmentation
Viaarxiv icon

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Add code
Mar 29, 2024
Viaarxiv icon

An Integrated Framework for Multi-Granular Explanation of Video Summarization

Add code
May 16, 2024
Figure 1 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 2 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 3 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 4 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Viaarxiv icon

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

Add code
Apr 02, 2024
Figure 1 for JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Figure 2 for JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Figure 3 for JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Figure 4 for JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments
Viaarxiv icon

Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation

Add code
Apr 04, 2024
Viaarxiv icon

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

Add code
Apr 15, 2024
Figure 1 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 2 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 3 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 4 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Viaarxiv icon