Picture for Karl Pertsch

Karl Pertsch

Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging

Add code
Dec 18, 2025
Viaarxiv icon

PolaRiS: Scalable Real-to-Sim Evaluations for Generalist Robot Policies

Add code
Dec 18, 2025
Viaarxiv icon

$π^{*}_{0.6}$: a VLA That Learns From Experience

Add code
Nov 19, 2025
Viaarxiv icon

Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better

Add code
May 29, 2025
Viaarxiv icon

Training Strategies for Efficient Embodied Reasoning

Add code
May 13, 2025
Viaarxiv icon

$π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization

Add code
Apr 22, 2025
Figure 1 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 2 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 3 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Figure 4 for $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
Viaarxiv icon

AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World

Add code
Mar 31, 2025
Viaarxiv icon

Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

Add code
Feb 26, 2025
Viaarxiv icon

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Add code
Jan 16, 2025
Figure 1 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 2 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 3 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Figure 4 for FAST: Efficient Action Tokenization for Vision-Language-Action Models
Viaarxiv icon

$π_0$: A Vision-Language-Action Flow Model for General Robot Control

Add code
Oct 31, 2024
Figure 1 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 2 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 3 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Figure 4 for $π_0$: A Vision-Language-Action Flow Model for General Robot Control
Viaarxiv icon