Picture for Rao Muhammad Anwer

Rao Muhammad Anwer

DuwatBench: Bridging Language and Visual Heritage through an Arabic Calligraphy Benchmark for Multimodal Understanding

Add code
Jan 27, 2026
Viaarxiv icon

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

Add code
Dec 18, 2025
Viaarxiv icon

Beyond Simple Edits: Composed Video Retrieval with Dense Modifications

Add code
Aug 19, 2025
Viaarxiv icon

RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping

Add code
Jul 31, 2025
Figure 1 for RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Figure 2 for RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Figure 3 for RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Figure 4 for RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
Viaarxiv icon

AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock

Add code
Jul 29, 2025
Figure 1 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 2 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 3 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 4 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Viaarxiv icon

TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models

Add code
Jun 13, 2025
Viaarxiv icon

TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation

Add code
Jun 06, 2025
Viaarxiv icon

Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks

Add code
May 30, 2025
Viaarxiv icon

Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs

Add code
May 26, 2025
Figure 1 for Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Figure 2 for Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Figure 3 for Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Figure 4 for Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Viaarxiv icon

OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning

Add code
May 22, 2025
Viaarxiv icon