Alert button
Picture for Peter Stone

Peter Stone

Alert button

Sony AI, The University of Texas at Austin

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Add code
Bookmark button
Alert button
Jun 24, 2022
James MacGlashan, Evan Archer, Alisa Devlic, Takuma Seno, Craig Sherstan, Peter R. Wurman, Peter Stone

Figure 1 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 2 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 3 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 4 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Viaarxiv icon

High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization

Add code
Bookmark button
Alert button
Jun 16, 2022
Pranav Atreya, Haresh Karnan, Kavan Singh Sikand, Xuesu Xiao, Garrett Warnell, Sadegh Rabiee, Peter Stone, Joydeep Biswas

Figure 1 for High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization
Figure 2 for High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization
Figure 3 for High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization
Figure 4 for High-Speed Accurate Robot Control using Learned Forward Kinodynamics and Non-linear Least Squares Optimization
Viaarxiv icon

Models of human preference for learning reward functions

Add code
Bookmark button
Alert button
Jun 05, 2022
W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Allievi

Figure 1 for Models of human preference for learning reward functions
Figure 2 for Models of human preference for learning reward functions
Figure 3 for Models of human preference for learning reward functions
Figure 4 for Models of human preference for learning reward functions
Viaarxiv icon

DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching

Add code
Bookmark button
Alert button
Jun 01, 2022
Caroline Wang, Ishan Durugkar, Elad Liebman, Peter Stone

Figure 1 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 2 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 3 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Figure 4 for DM$^2$: Distributed Multi-Agent Reinforcement Learning for Distribution Matching
Viaarxiv icon

COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles

Add code
Bookmark button
Alert button
May 04, 2022
Jiaxun Cui, Hang Qiu, Dian Chen, Peter Stone, Yuke Zhu

Figure 1 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles
Figure 2 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles
Figure 3 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles
Figure 4 for COOPERNAUT: End-to-End Driving with Cooperative Perception for Networked Vehicles
Viaarxiv icon

Effective Mutation Rate Adaptation through Group Elite Selection

Add code
Bookmark button
Alert button
Apr 11, 2022
Akarsh Kumar, Bo Liu, Risto Miikkulainen, Peter Stone

Figure 1 for Effective Mutation Rate Adaptation through Group Elite Selection
Figure 2 for Effective Mutation Rate Adaptation through Group Elite Selection
Figure 3 for Effective Mutation Rate Adaptation through Group Elite Selection
Figure 4 for Effective Mutation Rate Adaptation through Group Elite Selection
Viaarxiv icon

VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics

Add code
Bookmark button
Alert button
Mar 30, 2022
Haresh Karnan, Kavan Singh Sikand, Pranav Atreya, Sadegh Rabiee, Xuesu Xiao, Garrett Warnell, Peter Stone, Joydeep Biswas

Figure 1 for VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
Figure 2 for VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
Figure 3 for VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
Figure 4 for VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
Viaarxiv icon

Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

Add code
Bookmark button
Alert button
Mar 28, 2022
Haresh Karnan, Anirudh Nair, Xuesu Xiao, Garrett Warnell, Soeren Pirk, Alexander Toshev, Justin Hart, Joydeep Biswas, Peter Stone

Figure 1 for Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation
Figure 2 for Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation
Figure 3 for Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation
Figure 4 for Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation
Viaarxiv icon

Continual Learning and Private Unlearning

Add code
Bookmark button
Alert button
Mar 24, 2022
Bo Liu, Qiang Liu, Peter Stone

Figure 1 for Continual Learning and Private Unlearning
Figure 2 for Continual Learning and Private Unlearning
Viaarxiv icon

Visually Grounded Task and Motion Planning for Mobile Manipulation

Add code
Bookmark button
Alert button
Feb 24, 2022
Xiaohan Zhang, Yifeng Zhu, Yan Ding, Yuke Zhu, Peter Stone, Shiqi Zhang

Figure 1 for Visually Grounded Task and Motion Planning for Mobile Manipulation
Figure 2 for Visually Grounded Task and Motion Planning for Mobile Manipulation
Figure 3 for Visually Grounded Task and Motion Planning for Mobile Manipulation
Figure 4 for Visually Grounded Task and Motion Planning for Mobile Manipulation
Viaarxiv icon