Picture for Daniele Materia

Daniele Materia

Leveraging Gaze and Set-of-Mark in VLLMs for Human-Object Interaction Anticipation from Egocentric Videos

Add code
Apr 04, 2026
Viaarxiv icon