Picture for Salman Khan

Salman Khan

DocAtlas: Multilingual Document Understanding Across 80+ Languages

Add code
May 12, 2026
Viaarxiv icon

Agentic AI for Remote Sensing: Technical Challenges and Research Directions

Add code
Apr 27, 2026
Viaarxiv icon

Physics-Enhanced Deep Learning for Proactive Thermal Runaway Forecasting in Li-Ion Batteries

Add code
Apr 22, 2026
Viaarxiv icon

GCA Framework: A Gulf-Grounded Dataset and Agentic Pipeline for Climate Decision Support

Add code
Apr 14, 2026
Viaarxiv icon

Paper Circle: An Open-source Multi-agent Research Discovery and Analysis Framework

Add code
Apr 07, 2026
Viaarxiv icon

CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning

Add code
Apr 03, 2026
Viaarxiv icon

The Eleventh NTIRE 2026 Efficient Super-Resolution Challenge Report

Add code
Apr 03, 2026
Viaarxiv icon

CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare

Add code
Mar 25, 2026
Viaarxiv icon

WorldCache: Content-Aware Caching for Accelerated Video World Models

Add code
Mar 23, 2026
Viaarxiv icon

From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering

Add code
Mar 20, 2026
Viaarxiv icon