Picture for Ranjay Krishna

Ranjay Krishna

TrajTok: Learning Trajectory Tokens enables better Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

Scale Can't Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Add code
Feb 26, 2026
Viaarxiv icon

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Add code
Feb 22, 2026
Viaarxiv icon

MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

Add code
Feb 11, 2026
Viaarxiv icon

Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

Add code
Feb 08, 2026
Viaarxiv icon

Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration?

Add code
Feb 04, 2026
Viaarxiv icon

VLS: Steering Pretrained Robot Policies via Vision-Language Models

Add code
Feb 03, 2026
Viaarxiv icon

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Add code
Jan 15, 2026
Viaarxiv icon

GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation

Add code
Dec 18, 2025
Figure 1 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 2 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 3 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Figure 4 for GenEval 2: Addressing Benchmark Drift in Text-to-Image Evaluation
Viaarxiv icon