Picture for Peter Stone

Peter Stone

UT Austin, Sony AI

AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot

Add code
Apr 15, 2026
Viaarxiv icon

GUIDE: Reinforcement Learning for Behavioral Action Support in Type 1 Diabetes

Add code
Apr 01, 2026
Viaarxiv icon

ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors

Add code
Mar 16, 2026
Viaarxiv icon

Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning

Add code
Mar 12, 2026
Viaarxiv icon

Large-Language-Model-Guided State Estimation for Partially Observable Task and Motion Planning

Add code
Mar 04, 2026
Viaarxiv icon

Factored Latent Action World Models

Add code
Feb 18, 2026
Viaarxiv icon

The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning

Add code
Jan 23, 2026
Viaarxiv icon

Harmful Traits of AI Companions

Add code
Nov 18, 2025
Figure 1 for Harmful Traits of AI Companions
Viaarxiv icon

Terrain Costmap Generation via Scaled Preference Conditioning

Add code
Nov 14, 2025
Figure 1 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 2 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 3 for Terrain Costmap Generation via Scaled Preference Conditioning
Figure 4 for Terrain Costmap Generation via Scaled Preference Conditioning
Viaarxiv icon

Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy

Add code
Nov 12, 2025
Figure 1 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 2 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 3 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Figure 4 for Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
Viaarxiv icon