Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Francisco Aguilera-Martínez

Differential Privacy in Machine Learning: From Symbolic AI to LLMs

Jun 13, 2025

Francisco Aguilera-Martínez, Fernando Berzal

Abstract:Machine learning models should not reveal particular information that is not otherwise accessible. Differential privacy provides a formal framework to mitigate privacy risks by ensuring that the inclusion or exclusion of any single data point does not significantly alter the output of an algorithm, thus limiting the exposure of private information. This survey paper explores the foundational definitions of differential privacy, reviews its original formulations and tracing its evolution through key research contributions. It then provides an in-depth examination of how DP has been integrated into machine learning models, analyzing existing proposals and methods to preserve privacy when training ML models. Finally, it describes how DP-based ML techniques can be evaluated in practice. %Finally, it discusses the broader implications of DP, highlighting its potential for public benefit, its real-world applications, and the challenges it faces, including vulnerabilities to adversarial attacks. By offering a comprehensive overview of differential privacy in machine learning, this work aims to contribute to the ongoing development of secure and responsible AI systems.

* arXiv admin note: text overlap with arXiv:2303.00654 by other authors

Via

Access Paper or Ask Questions

LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures

May 02, 2025

Francisco Aguilera-Martínez, Fernando Berzal

Figure 1 for LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures

Figure 2 for LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures

Abstract:As large language models (LLMs) continue to evolve, it is critical to assess the security threats and vulnerabilities that may arise both during their training phase and after models have been deployed. This survey seeks to define and categorize the various attacks targeting LLMs, distinguishing between those that occur during the training phase and those that affect already trained models. A thorough analysis of these attacks is presented, alongside an exploration of defense mechanisms designed to mitigate such threats. Defenses are classified into two primary categories: prevention-based and detection-based defenses. Furthermore, our survey summarizes possible attacks and their corresponding defense strategies. It also provides an evaluation of the effectiveness of the known defense mechanisms for the different security threats. Our survey aims to offer a structured framework for securing LLMs, while also identifying areas that require further research to improve and strengthen defenses against emerging security challenges.

Via

Access Paper or Ask Questions

Differential Privacy Regularization: Protecting Training Data Through Loss Function Regularization

Sep 25, 2024

Francisco Aguilera-Martínez, Fernando Berzal

Abstract:Training machine learning models based on neural networks requires large datasets, which may contain sensitive information. The models, however, should not expose private information from these datasets. Differentially private SGD [DP-SGD] requires the modification of the standard stochastic gradient descent [SGD] algorithm for training new models. In this short paper, a novel regularization strategy is proposed to achieve the same goal in a more efficient manner.

Via

Access Paper or Ask Questions