Picture for Thariq Khalid

Thariq Khalid

GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding

Add code
Nov 09, 2025
Figure 1 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 2 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 3 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Figure 4 for GazeVLM: A Vision-Language Model for Multi-Task Gaze Understanding
Viaarxiv icon

Leveraging Multi-Modal Saliency and Fusion for Gaze Target Detection

Add code
Apr 27, 2025
Viaarxiv icon

PMODE: Prototypical Mask based Object Dimension Estimation

Add code
Dec 26, 2022
Viaarxiv icon

Ego Vehicle Speed Estimation using 3D Convolution with Masked Attention

Add code
Dec 11, 2022
Viaarxiv icon