Picture for Garin Kessler

Garin Kessler

Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation

Add code
Mar 19, 2026
Viaarxiv icon

Narrative Aligned Long Form Video Question Answering

Add code
Mar 19, 2026
Viaarxiv icon

CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models

Add code
Jan 08, 2026
Viaarxiv icon

From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding

Add code
Oct 02, 2025
Viaarxiv icon