Picture for Elisa Ricci

Elisa Ricci

Less is more: Summarizing Patch Tokens for efficient Multi-Label Class-Incremental Learning

Add code
May 24, 2024
Viaarxiv icon

SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

Add code
May 16, 2024
Viaarxiv icon

Vocabulary-free Image Classification and Semantic Segmentation

Add code
Apr 16, 2024
Figure 1 for Vocabulary-free Image Classification and Semantic Segmentation
Figure 2 for Vocabulary-free Image Classification and Semantic Segmentation
Figure 3 for Vocabulary-free Image Classification and Semantic Segmentation
Figure 4 for Vocabulary-free Image Classification and Semantic Segmentation
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Figure 1 for Socially Pertinent Robots in Gerontological Healthcare
Figure 2 for Socially Pertinent Robots in Gerontological Healthcare
Figure 3 for Socially Pertinent Robots in Gerontological Healthcare
Figure 4 for Socially Pertinent Robots in Gerontological Healthcare
Viaarxiv icon

Test-Time Zero-Shot Temporal Action Localization

Add code
Apr 11, 2024
Figure 1 for Test-Time Zero-Shot Temporal Action Localization
Figure 2 for Test-Time Zero-Shot Temporal Action Localization
Figure 3 for Test-Time Zero-Shot Temporal Action Localization
Figure 4 for Test-Time Zero-Shot Temporal Action Localization
Viaarxiv icon

MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning

Add code
Apr 08, 2024
Figure 1 for MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Figure 2 for MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Figure 3 for MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Figure 4 for MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning
Viaarxiv icon

Harnessing Large Language Models for Training-free Video Anomaly Detection

Add code
Apr 01, 2024
Figure 1 for Harnessing Large Language Models for Training-free Video Anomaly Detection
Figure 2 for Harnessing Large Language Models for Training-free Video Anomaly Detection
Figure 3 for Harnessing Large Language Models for Training-free Video Anomaly Detection
Figure 4 for Harnessing Large Language Models for Training-free Video Anomaly Detection
Viaarxiv icon

Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

Add code
Feb 22, 2024
Figure 1 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 2 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 3 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Figure 4 for Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Viaarxiv icon

Democratizing Fine-grained Visual Recognition with Large Language Models

Add code
Jan 24, 2024
Figure 1 for Democratizing Fine-grained Visual Recognition with Large Language Models
Figure 2 for Democratizing Fine-grained Visual Recognition with Large Language Models
Figure 3 for Democratizing Fine-grained Visual Recognition with Large Language Models
Figure 4 for Democratizing Fine-grained Visual Recognition with Large Language Models
Viaarxiv icon

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

Add code
Dec 07, 2023
Viaarxiv icon