Picture for Ranjay Krishna

Ranjay Krishna

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Add code
Mar 25, 2026
Viaarxiv icon

Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Add code
Mar 18, 2026
Viaarxiv icon

MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation

Add code
Mar 17, 2026
Viaarxiv icon

vla-eval: A Unified Evaluation Harness for Vision-Language-Action Models

Add code
Mar 14, 2026
Viaarxiv icon

Video-Based Reward Modeling for Computer-Use Agents

Add code
Mar 10, 2026
Viaarxiv icon

TrajTok: Learning Trajectory Tokens enables better Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Add code
Feb 22, 2026
Viaarxiv icon

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

Add code
Feb 11, 2026
Viaarxiv icon