Picture for Stefano Gasperini

Stefano Gasperini

P-JEPA: Procedural Video Representation Learning via Joint Embedding Predictive Architecture

Add code
Jun 22, 2026
Viaarxiv icon

Prompting Diffusion Models for Zero-Shot Instance Segmentation

Add code
Jun 21, 2026
Viaarxiv icon

Future Dynamic 3D Reconstruction: A 3D World Model with Disentangled Ego-Motion

Add code
Jun 16, 2026
Viaarxiv icon

SA4Depth: Consistent Pose-Depth Scale Alignment for Self-Supervised Monocular Depth Estimation

Add code
May 27, 2026
Viaarxiv icon

OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention

Add code
May 07, 2026
Viaarxiv icon

GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting

Add code
Aug 21, 2025
Figure 1 for GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
Figure 2 for GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
Figure 3 for GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
Figure 4 for GALA: Guided Attention with Language Alignment for Open Vocabulary Gaussian Splatting
Viaarxiv icon

Prior2Former -- Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation

Add code
Apr 07, 2025
Viaarxiv icon

From Open-Vocabulary to Vocabulary-Free Semantic Segmentation

Add code
Feb 17, 2025
Viaarxiv icon

SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians

Add code
Dec 13, 2024
Figure 1 for SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians
Figure 2 for SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians
Figure 3 for SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians
Figure 4 for SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians
Viaarxiv icon

Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

Add code
Mar 21, 2024
Figure 1 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
Figure 2 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
Figure 3 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
Figure 4 for Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation
Viaarxiv icon