Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alessandro Palmas

Reinforcement Learning for Decision-Level Interception Prioritization in Drone Swarm Defense

Aug 01, 2025

Alessandro Palmas

Abstract:The growing threat of low-cost kamikaze drone swarms poses a critical challenge to modern defense systems demanding rapid and strategic decision-making to prioritize interceptions across multiple effectors and high-value target zones. In this work, we present a case study demonstrating the practical advantages of reinforcement learning in addressing this challenge. We introduce a high-fidelity simulation environment that captures realistic operational constraints, within which a decision-level reinforcement learning agent learns to coordinate multiple effectors for optimal interception prioritization. Operating in a discrete action space, the agent selects which drone to engage per effector based on observed state features such as positions, classes, and effector status. We evaluate the learned policy against a handcrafted rule-based baseline across hundreds of simulated attack scenarios. The reinforcement learning based policy consistently achieves lower average damage and higher defensive efficiency in protecting critical zones. This case study highlights the potential of reinforcement learning as a strategic layer within defense architectures, enhancing resilience without displacing existing control systems. All code and simulation assets are publicly released for full reproducibility, and a video demonstration illustrates the policy's qualitative behavior.

* 11 pages, 10 figures

Via

Access Paper or Ask Questions

Deep Learning Computer Vision Algorithms for Real-time UAVs On-board Camera Image Processing

Nov 02, 2022

Alessandro Palmas, Pietro Andronico

Abstract:This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs. Four use cases are considered: target detection, classification and localization, road segmentation for autonomous navigation in GNSS-denied zones, human body segmentation, and human action recognition. All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks. Acquisition campaigns have been carried out to collect custom datasets reflecting typical operational scenarios, where the peculiar point of view of a multi-rotor UAV is replicated. Algorithms architectures and trained models performances are reported, showing high levels of both accuracy and inference speed. Output examples and on-field videos are presented, demonstrating models operation when deployed on a GPU-powered commercial embedded device (NVIDIA Jetson Xavier) mounted on board of a custom quad-rotor, paving the way to enabling high level autonomy.

* 10 pages, 12 figures, NATO AVT-353 Research Workshop "Artificial Intelligence in Cockpits for UAVs", Turin, Italy, 26 April 2022

Via

Access Paper or Ask Questions

DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Oct 19, 2022

Alessandro Palmas

Figure 1 for DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Figure 2 for DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Figure 3 for DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Figure 4 for DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation

Abstract:The recent advances in reinforcement learning have led to effective methods able to obtain above human-level performances in very complex environments. However, once solved, these environments become less valuable, and new challenges with different or more complex scenarios are needed to support research advances. This work presents DIAMBRA Arena, a new platform for reinforcement learning research and experimentation, featuring a collection of high-quality environments exposing a Python API fully compliant with OpenAI Gym standard. They are episodic tasks with discrete actions and observations composed by raw pixels plus additional numerical values, all supporting both single player and two players mode, allowing to work on standard reinforcement learning, competitive multi-agent, human-agent competition, self-play, human-in-the-loop training and imitation learning. Software capabilities are demonstrated by successfully training multiple deep reinforcement learning agents with proximal policy optimization obtaining human-like behavior. Results confirm the utility of DIAMBRA Arena as a reinforcement learning research tool, providing environments designed to study some of the most challenging topics in the field.

* 14 pages, 7 figures

Via

Access Paper or Ask Questions