Picture for Pratap Tokekar

Pratap Tokekar

University of Maryland, College Park

Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis

Add code
Mar 13, 2024
Figure 1 for Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Figure 2 for Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Figure 3 for Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Figure 4 for Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
Viaarxiv icon

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning

Add code
Mar 13, 2024
Figure 1 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 2 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 3 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Figure 4 for Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning
Viaarxiv icon

REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback

Add code
Dec 22, 2023
Figure 1 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 2 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 3 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Figure 4 for REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback
Viaarxiv icon

Enhancing Multi-Agent Coordination through Common Operating Picture Integration

Add code
Nov 08, 2023
Figure 1 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 2 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 3 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Figure 4 for Enhancing Multi-Agent Coordination through Common Operating Picture Integration
Viaarxiv icon

AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV

Add code
Oct 11, 2023
Figure 1 for AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV
Figure 2 for AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV
Figure 3 for AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV
Figure 4 for AG-CVG: Coverage Planning with a Mobile Recharging UGV and an Energy-Constrained UAV
Viaarxiv icon

Pre-Trained Masked Image Model for Mobile Robot Navigation

Add code
Oct 10, 2023
Viaarxiv icon

D2M2N: Decentralized Differentiable Memory-Enabled Mapping and Navigation for Multiple Robots

Add code
Oct 10, 2023
Figure 1 for D2M2N: Decentralized Differentiable Memory-Enabled Mapping and Navigation for Multiple Robots
Figure 2 for D2M2N: Decentralized Differentiable Memory-Enabled Mapping and Navigation for Multiple Robots
Figure 3 for D2M2N: Decentralized Differentiable Memory-Enabled Mapping and Navigation for Multiple Robots
Figure 4 for D2M2N: Decentralized Differentiable Memory-Enabled Mapping and Navigation for Multiple Robots
Viaarxiv icon

Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots

Add code
Oct 02, 2023
Figure 1 for Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots
Figure 2 for Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots
Figure 3 for Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots
Figure 4 for Energy-Aware Route Planning for a Battery-Constrained Robot with Multiple Charging Depots
Viaarxiv icon

Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring

Add code
Oct 02, 2023
Figure 1 for Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring
Figure 2 for Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring
Figure 3 for Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring
Figure 4 for Decision-Oriented Intervention Cost Prediction for Multi-robot Persistent Monitoring
Viaarxiv icon

LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments

Add code
Sep 30, 2023
Figure 1 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 2 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 3 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Figure 4 for LANCAR: Leveraging Language for Context-Aware Robot Locomotion in Unstructured Environments
Viaarxiv icon