Panoptic Segmentation


Panoptic segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to things classes (countable objects with instances, like cars and people) are assigned unique instance IDs.

S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points

Add code
Aug 23, 2024
Viaarxiv icon

An Integrated Framework for Multi-Granular Explanation of Video Summarization

Add code
May 16, 2024
Figure 1 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 2 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 3 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Figure 4 for An Integrated Framework for Multi-Granular Explanation of Video Summarization
Viaarxiv icon

ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning

Add code
Mar 29, 2024
Viaarxiv icon

F-LMM: Grounding Frozen Large Multimodal Models

Add code
Jun 09, 2024
Figure 1 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 2 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 3 for F-LMM: Grounding Frozen Large Multimodal Models
Figure 4 for F-LMM: Grounding Frozen Large Multimodal Models
Viaarxiv icon

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

Add code
Apr 02, 2024
Viaarxiv icon

Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation

Add code
Apr 04, 2024
Viaarxiv icon

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

Add code
Apr 15, 2024
Figure 1 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 2 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 3 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Figure 4 for kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies
Viaarxiv icon

COCONut: Modernizing COCO Segmentation

Add code
Apr 12, 2024
Viaarxiv icon

Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception

Add code
May 12, 2024
Figure 1 for Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Figure 2 for Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Figure 3 for Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Figure 4 for Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception
Viaarxiv icon

Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images

Add code
Jul 19, 2024
Figure 1 for Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images
Figure 2 for Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images
Figure 3 for Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images
Figure 4 for Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images
Viaarxiv icon