Picture for Hisham Cholakkal

Hisham Cholakkal

equal contribution

MediX-R1: Open Ended Medical Reinforcement Learning

Add code
Feb 26, 2026
Viaarxiv icon

Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device

Add code
Feb 24, 2026
Viaarxiv icon

Audit After Segmentation: Reference-Free Mask Quality Assessment for Language-Referred Audio-Visual Segmentation

Add code
Feb 03, 2026
Viaarxiv icon

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

Add code
Dec 18, 2025
Viaarxiv icon

How Good are Foundation Models in Step-by-Step Embodied Reasoning?

Add code
Sep 18, 2025
Figure 1 for How Good are Foundation Models in Step-by-Step Embodied Reasoning?
Figure 2 for How Good are Foundation Models in Step-by-Step Embodied Reasoning?
Figure 3 for How Good are Foundation Models in Step-by-Step Embodied Reasoning?
Figure 4 for How Good are Foundation Models in Step-by-Step Embodied Reasoning?
Viaarxiv icon

AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock

Add code
Jul 29, 2025
Figure 1 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 2 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 3 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Figure 4 for AI in Agriculture: A Survey of Deep Learning Techniques for Crops, Fisheries and Livestock
Viaarxiv icon

HyRet-Change: A hybrid retentive network for remote sensing change detection

Add code
Jun 15, 2025
Viaarxiv icon

InceptionMamba: Efficient Multi-Stage Feature Enhancement with Selective State Space Model for Microscopic Medical Image Segmentation

Add code
Jun 13, 2025
Viaarxiv icon

TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models

Add code
Jun 13, 2025
Viaarxiv icon

A Culturally-diverse Multilingual Multimodal Video Benchmark & Model

Add code
Jun 08, 2025
Viaarxiv icon