Picture for Ranjay Krishna

Ranjay Krishna

Agile Deliberation: Concept Deliberation for Subjective Visual Classification

Add code
Dec 11, 2025
Figure 1 for Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Figure 2 for Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Figure 3 for Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Figure 4 for Agile Deliberation: Concept Deliberation for Subjective Visual Classification
Viaarxiv icon

OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation

Add code
Nov 17, 2025
Viaarxiv icon

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

Add code
Nov 06, 2025
Viaarxiv icon

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Add code
Oct 30, 2025
Figure 1 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 2 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 3 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Figure 4 for ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Viaarxiv icon

Visual Representations inside the Language Model

Add code
Oct 06, 2025
Figure 1 for Visual Representations inside the Language Model
Figure 2 for Visual Representations inside the Language Model
Figure 3 for Visual Representations inside the Language Model
Figure 4 for Visual Representations inside the Language Model
Viaarxiv icon

FailSafe: Reasoning and Recovery from Failures in Vision-Language-Action Models

Add code
Oct 02, 2025
Viaarxiv icon

Explain Before You Answer: A Survey on Compositional Visual Reasoning

Add code
Aug 24, 2025
Figure 1 for Explain Before You Answer: A Survey on Compositional Visual Reasoning
Figure 2 for Explain Before You Answer: A Survey on Compositional Visual Reasoning
Figure 3 for Explain Before You Answer: A Survey on Compositional Visual Reasoning
Figure 4 for Explain Before You Answer: A Survey on Compositional Visual Reasoning
Viaarxiv icon

MolmoAct: Action Reasoning Models that can Reason in Space

Add code
Aug 12, 2025
Figure 1 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 2 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 3 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 4 for MolmoAct: Action Reasoning Models that can Reason in Space
Viaarxiv icon

MultiRef: Controllable Image Generation with Multiple Visual References

Add code
Aug 09, 2025
Figure 1 for MultiRef: Controllable Image Generation with Multiple Visual References
Figure 2 for MultiRef: Controllable Image Generation with Multiple Visual References
Figure 3 for MultiRef: Controllable Image Generation with Multiple Visual References
Figure 4 for MultiRef: Controllable Image Generation with Multiple Visual References
Viaarxiv icon

The Delta Learning Hypothesis: Preference Tuning on Weak Data can Yield Strong Gains

Add code
Jul 08, 2025
Viaarxiv icon