Picture for Mohammed Bennamoun

Mohammed Bennamoun

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

Add code
Nov 13, 2024
Figure 1 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 2 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 3 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 4 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Viaarxiv icon

Referring Human Pose and Mask Estimation in the Wild

Add code
Oct 27, 2024
Viaarxiv icon

Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels

Add code
Oct 05, 2024
Viaarxiv icon

A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures

Add code
Aug 22, 2024
Figure 1 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 2 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 3 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Figure 4 for A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures
Viaarxiv icon

Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions

Add code
Jul 27, 2024
Figure 1 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 2 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 3 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Figure 4 for Faster Image2Video Generation: A Closer Look at CLIP Image Embedding's Impact on Spatio-Temporal Cross-Attentions
Viaarxiv icon

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

Add code
Jul 06, 2024
Viaarxiv icon

Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

Add code
Jun 28, 2024
Viaarxiv icon

Supervised Radio Frequency Interference Detection with SNNs

Add code
Jun 10, 2024
Figure 1 for Supervised Radio Frequency Interference Detection with SNNs
Figure 2 for Supervised Radio Frequency Interference Detection with SNNs
Figure 3 for Supervised Radio Frequency Interference Detection with SNNs
Figure 4 for Supervised Radio Frequency Interference Detection with SNNs
Viaarxiv icon

CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

Add code
Jun 07, 2024
Figure 1 for CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
Figure 2 for CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
Figure 3 for CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
Figure 4 for CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment
Viaarxiv icon

Language Model Guided Interpretable Video Action Reasoning

Add code
Apr 02, 2024
Viaarxiv icon