Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hikmat Farhat

High-resolution Image-based Malware Classification using Multiple Instance Learning

Nov 21, 2023

Tim Peters, Hikmat Farhat

Abstract:This paper proposes a novel method of classifying malware into families using high-resolution greyscale images and multiple instance learning to overcome adversarial binary enlargement. Current methods of visualisation-based malware classification largely rely on lossy transformations of inputs such as resizing to handle the large, variable-sized images. Through empirical analysis and experimentation, it is shown that these approaches cause crucial information loss that can be exploited. The proposed solution divides the images into patches and uses embedding-based multiple instance learning with a convolutional neural network and an attention aggregation function for classification. The implementation is evaluated on the Microsoft Malware Classification dataset and achieves accuracies of up to $96.6\%$ on adversarially enlarged samples compared to the baseline of $22.8\%$. The Python code is available online at https://github.com/timppeters/MIL-Malware-Images .

* 14 pages, 13 figures, 2 tables

Via

Access Paper or Ask Questions

Malware Classification Using Transfer Learning

Jul 29, 2021

Hikmat Farhat, Veronica Rammouz

Figure 1 for Malware Classification Using Transfer Learning

Figure 2 for Malware Classification Using Transfer Learning

Figure 3 for Malware Classification Using Transfer Learning

Figure 4 for Malware Classification Using Transfer Learning

Abstract:With the rapid growth of the number of devices on the Internet, malware poses a threat not only to the affected devices but also their ability to use said devices to launch attacks on the Internet ecosystem. Rapid malware classification is an important tools to combat that threat. One of the successful approaches to classification is based on malware images and deep learning. While many deep learning architectures are very accurate they usually take a long time to train. In this work we perform experiments on multiple well known, pre-trained, deep network architectures in the context of transfer learning. We show that almost all them classify malware accurately with a very short training period.

Via

Access Paper or Ask Questions