Picture for Abdelrahman Shaker

Abdelrahman Shaker

GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model

Add code
Jul 18, 2024
Figure 1 for GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
Figure 2 for GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
Figure 3 for GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
Figure 4 for GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
Viaarxiv icon

Efficient Video Object Segmentation via Modulated Cross-Attention Memory

Add code
Mar 26, 2024
Figure 1 for Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Figure 2 for Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Figure 3 for Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Figure 4 for Efficient Video Object Segmentation via Modulated Cross-Attention Memory
Viaarxiv icon

PALO: A Polyglot Large Multimodal Model for 5B People

Add code
Mar 05, 2024
Figure 1 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 2 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 3 for PALO: A Polyglot Large Multimodal Model for 5B People
Figure 4 for PALO: A Polyglot Large Multimodal Model for 5B People
Viaarxiv icon

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM

Add code
Dec 14, 2023
Figure 1 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 2 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 3 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 4 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Viaarxiv icon

GLaMM: Pixel Grounding Large Multimodal Model

Add code
Nov 06, 2023
Viaarxiv icon

Learnable Weight Initialization for Volumetric Medical Image Segmentation

Add code
Jun 28, 2023
Figure 1 for Learnable Weight Initialization for Volumetric Medical Image Segmentation
Figure 2 for Learnable Weight Initialization for Volumetric Medical Image Segmentation
Figure 3 for Learnable Weight Initialization for Volumetric Medical Image Segmentation
Figure 4 for Learnable Weight Initialization for Volumetric Medical Image Segmentation
Viaarxiv icon

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models

Add code
Jun 13, 2023
Figure 1 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 2 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 3 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Figure 4 for XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models
Viaarxiv icon

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications

Add code
Mar 27, 2023
Figure 1 for SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Figure 2 for SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Figure 3 for SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Figure 4 for SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Viaarxiv icon

UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation

Add code
Dec 08, 2022
Figure 1 for UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
Figure 2 for UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
Figure 3 for UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
Figure 4 for UNETR++: Delving into Efficient and Accurate 3D Medical Image Segmentation
Viaarxiv icon

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications

Add code
Jun 21, 2022
Figure 1 for EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Figure 2 for EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Figure 3 for EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Figure 4 for EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications
Viaarxiv icon