Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrea Ponte

Empirical Quantification of Spurious Correlations in Malware Detection

Jun 11, 2025

Bianca Perasso, Ludovico Lozza, Andrea Ponte, Luca Demetrio, Luca Oneto, Fabio Roli

Figure 1 for Empirical Quantification of Spurious Correlations in Malware Detection

Figure 2 for Empirical Quantification of Spurious Correlations in Malware Detection

Figure 3 for Empirical Quantification of Spurious Correlations in Malware Detection

Figure 4 for Empirical Quantification of Spurious Correlations in Malware Detection

Abstract:End-to-end deep learning exhibits unmatched performance for detecting malware, but such an achievement is reached by exploiting spurious correlations -- features with high relevance at inference time, but known to be useless through domain knowledge. While previous work highlighted that deep networks mainly focus on metadata, none investigated the phenomenon further, without quantifying their impact on the decision. In this work, we deepen our understanding of how spurious correlation affects deep learning for malware detection by highlighting how much models rely on empty spaces left by the compiler, which diminishes the relevance of the compiled code. Through our seminal analysis on a small-scale balanced dataset, we introduce a ranking of two end-to-end models to better understand which is more suitable to be put in production.

Via

Access Paper or Ask Questions

SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines

May 23, 2024

Andrea Ponte, Dmitrijs Trizna, Luca Demetrio, Battista Biggio, Fabio Roli

Figure 1 for SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines

Figure 2 for SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines

Figure 3 for SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines

Figure 4 for SLIFER: Investigating Performance and Robustness of Malware Detection Pipelines

Abstract:As a result of decades of research, Windows malware detection is approached through a plethora of techniques. However, there is an ongoing mismatch between academia -- which pursues an optimal performances in terms of detection rate and low false alarms -- and the requirements of real-world scenarios. In particular, academia focuses on combining static and dynamic analysis within a single or ensemble of models, falling into several pitfalls like (i) firing dynamic analysis without considering the computational burden it requires; (ii) discarding impossible-to-analyse samples; and (iii) analysing robustness against adversarial attacks without considering that malware detectors are complemented with more non-machine-learning components. Thus, in this paper we propose SLIFER, a novel Windows malware detection pipeline sequentially leveraging both static and dynamic analysis, interrupting computations as soon as one module triggers an alarm, requiring dynamic analysis only when needed. Contrary to the state of the art, we investigate how to deal with samples resistance to analysis, showing how much they impact performances, concluding that it is better to flag them as legitimate to not drastically increase false alarms. Lastly, we perform a robustness evaluation of SLIFER leveraging content-injections attacks, and we show that, counter-intuitively, attacks are blocked more by YARA rules than dynamic analysis due to byte artifacts created while optimizing the adversarial strategy.

Via

Access Paper or Ask Questions