Picture for Rao Muhammad Anwer

Rao Muhammad Anwer

Multi-modal Generation via Cross-Modal In-Context Learning

Add code
May 28, 2024
Figure 1 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 2 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 3 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 4 for Multi-modal Generation via Cross-Modal In-Context Learning
Viaarxiv icon

Composed Video Retrieval via Enriched Context and Discriminative Embeddings

Add code
Mar 25, 2024
Figure 1 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 2 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 3 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Figure 4 for Composed Video Retrieval via Enriched Context and Discriminative Embeddings
Viaarxiv icon

Semi-supervised Open-World Object Detection

Add code
Feb 25, 2024
Figure 1 for Semi-supervised Open-World Object Detection
Figure 2 for Semi-supervised Open-World Object Detection
Figure 3 for Semi-supervised Open-World Object Detection
Figure 4 for Semi-supervised Open-World Object Detection
Viaarxiv icon

BiMediX: Bilingual Medical Mixture of Experts LLM

Add code
Feb 20, 2024
Viaarxiv icon

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM

Add code
Dec 14, 2023
Figure 1 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 2 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 3 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Figure 4 for Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM
Viaarxiv icon

SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation

Add code
Sep 28, 2023
Viaarxiv icon

3D Indoor Instance Segmentation in an Open-World

Add code
Sep 25, 2023
Figure 1 for 3D Indoor Instance Segmentation in an Open-World
Figure 2 for 3D Indoor Instance Segmentation in an Open-World
Figure 3 for 3D Indoor Instance Segmentation in an Open-World
Figure 4 for 3D Indoor Instance Segmentation in an Open-World
Viaarxiv icon

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation

Add code
Sep 20, 2023
Viaarxiv icon

A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos

Add code
Sep 09, 2023
Figure 1 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 2 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 3 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Figure 4 for A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Viaarxiv icon

Foundational Models Defining a New Era in Vision: A Survey and Outlook

Add code
Jul 25, 2023
Figure 1 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 2 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 3 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 4 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Viaarxiv icon