Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Beshah Ayalew

Shared learning of powertrain control policies for vehicle fleets

Apr 27, 2024

Lindsey Kerbel, Beshah Ayalew, Andrej Ivanco

Abstract:Emerging data-driven approaches, such as deep reinforcement learning (DRL), aim at on-the-field learning of powertrain control policies that optimize fuel economy and other performance metrics. Indeed, they have shown great potential in this regard for individual vehicles on specific routes or drive cycles. However, for fleets of vehicles that must service a distribution of routes, DRL approaches struggle with learning stability issues that result in high variances and challenge their practical deployment. In this paper, we present a novel framework for shared learning among a fleet of vehicles through the use of a distilled group policy as the knowledge sharing mechanism for the policy learning computations at each vehicle. We detail the mathematical formulation that makes this possible. Several scenarios are considered to analyze the functionality, performance, and computational scalability of the framework with fleet size. Comparisons of the cumulative performance of fleets using our proposed shared learning approach with a baseline of individual learning agents and another state-of-the-art approach with a centralized learner show clear advantages to our approach. For example, we find a fleet average asymptotic improvement of 8.5 percent in fuel economy compared to the baseline while also improving on the metrics of acceleration error and shifting frequency for fleets serving a distribution of suburban routes. Furthermore, we include demonstrative results that show how the framework reduces variance within a fleet and also how it helps individual agents adapt better to new routes.

* Elsevier Applied Energy Volume 365, 1 July 2024, 123217

Via

Access Paper or Ask Questions

Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

Jan 03, 2023

Habtamu Hailemichael, Beshah Ayalew, Lindsey Kerbel, Andrej Ivanco, Keith Loiselle

Figure 1 for Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

Figure 2 for Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

Figure 3 for Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

Figure 4 for Safe Reinforcement Learning for an Energy-Efficient Driver Assistance System

Abstract:Reinforcement learning (RL)-based driver assistance systems seek to improve fuel consumption via continual improvement of powertrain control actions considering experiential data from the field. However, the need to explore diverse experiences in order to learn optimal policies often limits the application of RL techniques in safety-critical systems like vehicle control. In this paper, an exponential control barrier function (ECBF) is derived and utilized to filter unsafe actions proposed by an RL-based driver assistance system. The RL agent freely explores and optimizes the performance objectives while unsafe actions are projected to the closest actions in the safe domain. The reward is structured so that driver's acceleration requests are met in a manner that boosts fuel economy and doesn't compromise comfort. The optimal gear and traction torque control actions that maximize the cumulative reward are computed via the Maximum a Posteriori Policy Optimization (MPO) algorithm configured for a hybrid action space. The proposed safe-RL scheme is trained and evaluated in car following scenarios where it is shown that it effectively avoids collision both during training and evaluation while delivering on the expected fuel economy improvements for the driver assistance system.

* IFAC-PapersOnLine, Volume 55, Issue 37, 2022, Pages 615-620

Via

Access Paper or Ask Questions

Residual Policy Learning for Powertrain Control

Dec 15, 2022

Lindsey Kerbel, Beshah Ayalew, Andrej Ivanco, Keith Loiselle

Figure 1 for Residual Policy Learning for Powertrain Control

Figure 2 for Residual Policy Learning for Powertrain Control

Figure 3 for Residual Policy Learning for Powertrain Control

Figure 4 for Residual Policy Learning for Powertrain Control

Abstract:Eco-driving strategies have been shown to provide significant reductions in fuel consumption. This paper outlines an active driver assistance approach that uses a residual policy learning (RPL) agent trained to provide residual actions to default power train controllers while balancing fuel consumption against other driver-accommodation objectives. Using previous experiences, our RPL agent learns improved traction torque and gear shifting residual policies to adapt the operation of the powertrain to variations and uncertainties in the environment. For comparison, we consider a traditional reinforcement learning (RL) agent trained from scratch. Both agents employ the off-policy Maximum A Posteriori Policy Optimization algorithm with an actor-critic architecture. By implementing on a simulated commercial vehicle in various car-following scenarios, we find that the RPL agent quickly learns significantly improved policies compared to a baseline source policy but in some measures not as good as those eventually possible with the RL agent trained from scratch.

* IFAC Papers Online Vol 55 (2022)
* 10th IFAC Symposium on Advances in Automotive Control AAC 2022

Via

Access Paper or Ask Questions

Probabilistic Constraint Tightening Techniques for Trajectory Planning with Predictive Control

Dec 15, 2022

Nathan Goulet, Qian Wang, Beshah Ayalew

Abstract:In order for automated mobile vehicles to navigate in the real world with minimal collision risks, it is necessary for their planning algorithms to consider uncertainties from measurements and environmental disturbances. In this paper, we consider analytical solutions for a conservative approximation of the mutual probability of collision between two robotic vehicles in the presence of such uncertainties. Therein, we present two methods, which we call unitary scaling and principal axes rotation, for decoupling the bivariate integral required for efficient approximation of the probability of collision between two vehicles including orientation effects. We compare the conservatism of these methods analytically and numerically. By closing a control loop through a model predictive guidance scheme, we observe through Monte-Carlo simulations that directly implementing collision avoidance constraints from the conservative approximations remains infeasible for real-time planning. We then propose and implement a convexification approach based on the tightened collision constraints that significantly improves the computational efficiency and robustness of the predictive guidance scheme.

* Journal of the Franklin Institute, Volume 359, Issue 12, August 2022, Pages 6142-6172

Via

Access Paper or Ask Questions

Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning

Dec 15, 2022

Lindsey Kerbel, Beshah Ayalew, Andrej Ivanco, Keith Loiselle

Figure 1 for Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning

Figure 2 for Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning

Figure 3 for Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning

Figure 4 for Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning

Abstract:With the growing need to reduce energy consumption and greenhouse gas emissions, Eco-driving strategies provide a significant opportunity for additional fuel savings on top of other technological solutions being pursued in the transportation sector. In this paper, a model-free deep reinforcement learning (RL) control agent is proposed for active Eco-driving assistance that trades-off fuel consumption against other driver-accommodation objectives, and learns optimal traction torque and transmission shifting policies from experience. The training scheme for the proposed RL agent uses an off-policy actor-critic architecture that iteratively does policy evaluation with a multi-step return and policy improvement with the maximum posteriori policy optimization algorithm for hybrid action spaces. The proposed Eco-driving RL agent is implemented on a commercial vehicle in car following traffic. It shows superior performance in minimizing fuel consumption compared to a baseline controller that has full knowledge of fuel-efficiency tables.

* 2022 American Control Conference

Via

Access Paper or Ask Questions