Picture for Peter Stone

Peter Stone

UT Austin, Sony AI

The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning

Add code
Jan 23, 2026
Viaarxiv icon

Harmful Traits of AI Companions

Add code
Nov 18, 2025
Figure 1 for Harmful Traits of AI Companions
Viaarxiv icon

Terrain Costmap Generation via Scaled Preference Conditioning

Add code
Nov 14, 2025
Figure 1 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 2 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 3 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 4 for Terrain Costmap Generation via Scaled Preference Conditioning
Viaarxiv icon

Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy

Add code
Nov 12, 2025
Figure 1 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 2 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 3 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 4 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Viaarxiv icon

LLM-GROP: Visually Grounded Robot Task and Motion Planning with Large Language Models

Add code
Nov 11, 2025
Viaarxiv icon

SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation

Add code
Sep 10, 2025
Viaarxiv icon

A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon

Add code
Jun 12, 2025
Viaarxiv icon

SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL

Add code
Jun 07, 2025
Viaarxiv icon

ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork

Add code
May 29, 2025
Viaarxiv icon

Towards Natural Language Communication for Cooperative Autonomous Driving via Self-Play

Add code
May 23, 2025
Viaarxiv icon