Picture for Sarah Ostadabbas

Sarah Ostadabbas

HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Motion-o: Trajectory-Grounded Video Reasoning

Add code
Mar 19, 2026
Viaarxiv icon

UniTrack: Differentiable Graph Representation Learning for Multi-Object Tracking

Add code
Feb 04, 2026
Viaarxiv icon

Structured Over Scale: Learning Spatial Reasoning from Educational Video

Add code
Jan 30, 2026
Viaarxiv icon

Broadening View Synthesis of Dynamic Scenes from Constrained Monocular Videos

Add code
Dec 16, 2025
Viaarxiv icon

K-Track: Kalman-Enhanced Tracking for Accelerating Deep Point Trackers on Edge Devices

Add code
Dec 11, 2025
Viaarxiv icon

Lang2Motion: Bridging Language and Motion through Joint Embedding Spaces

Add code
Dec 11, 2025
Viaarxiv icon

Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos

Add code
Dec 11, 2025
Figure 1 for Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Figure 2 for Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Figure 3 for Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Figure 4 for Track and Caption Any Motion: Query-Free Motion Discovery and Description in Videos
Viaarxiv icon

AdSum: Two-stream Audio-visual Summarization for Automated Video Advertisement Clipping

Add code
Oct 30, 2025
Viaarxiv icon

Learning Multimodal AI Algorithms for Amplifying Limited User Input into High-dimensional Control Space

Add code
May 16, 2025
Viaarxiv icon