Picture for Arshad Ali Khan

Arshad Ali Khan

GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding

Add code
Nov 09, 2025
Figure 1 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 2 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 3 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 4 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Viaarxiv icon

Neural Brain: A Neuroscience-inspired Framework for Embodied Agents

Add code
May 14, 2025
Viaarxiv icon

Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection

Add code
Apr 27, 2025
Viaarxiv icon

Leveraging Neo4j and deep learning for traffic congestion simulation & optimization

Add code
Apr 01, 2023
Viaarxiv icon