Picture for Athul M. Mathew

Athul M. Mathew

GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding

Add code
Nov 09, 2025
Figure 1 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 2 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 3 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 4 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Viaarxiv icon

Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection

Add code
Apr 27, 2025
Viaarxiv icon

Ego Vehicle Speed Estimation using 3D Convolution with Masked Attention

Add code
Dec 11, 2022
Viaarxiv icon