Picture for Suraj Nair

Suraj Nair

Emergence of Human to Robot Transfer in Vision-Language-Action Models

Add code
Dec 27, 2025
Viaarxiv icon

$π^{*}_{0.6}$: a VLA That Learns From Experience

Add code
Nov 19, 2025
Viaarxiv icon

A Representation Sharpening Framework for Zero Shot Dense Retrieval

Add code
Nov 07, 2025
Viaarxiv icon

$π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization

Add code
Apr 22, 2025
Figure 1 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 2 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 3 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 4 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Viaarxiv icon

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Add code
Jan 16, 2025
Figure 1 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 2 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 3 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 4 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Viaarxiv icon

$π_0$: A Vision-Language-Action Flow Model for General Robot Control

Add code
Oct 31, 2024
Figure 1 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 2 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 3 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 4 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Viaarxiv icon

GHIL-Glue: Hierarchical Control with Filtered Subgoal Images

Add code
Oct 26, 2024
Figure 1 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 2 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 3 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Figure 4 for GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
Viaarxiv icon

OpenVLA: An Open-Source Vision-Language-Action Model

Add code
Jun 13, 2024
Figure 1 for OpenVLA: An Open-Source Vision-Language-Action Model
Figure 2 for OpenVLA: An Open-Source Vision-Language-Action Model
Figure 3 for OpenVLA: An Open-Source Vision-Language-Action Model
Figure 4 for OpenVLA: An Open-Source Vision-Language-Action Model
Viaarxiv icon

Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval

Add code
Apr 29, 2024
Figure 1 for Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval
Figure 2 for Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval
Figure 3 for Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval
Figure 4 for Efficiency-Effectiveness Tradeoff of Probabilistic Structured Queries for Cross-Language Information Retrieval
Viaarxiv icon

DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

Add code
Mar 19, 2024
Figure 1 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 2 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 3 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Figure 4 for DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
Viaarxiv icon