Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Mar 13, 2020

Kiran Karra, Chace Ashcraft, Neil Fendley

Figure 1 for The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Figure 2 for The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Figure 3 for The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Figure 4 for The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Share this with someone who'll enjoy it:

Abstract:In this paper, we introduce the TrojAI software framework, an open source set of Python tools capable of generating triggered (poisoned) datasets and associated deep learning (DL) models with trojans at scale. We utilize the developed framework to generate a large set of trojaned MNIST classifiers, as well as demonstrate the capability to produce a trojaned reinforcement-learning model using vector observations. Results on MNIST show that the nature of the trigger, training batch size, and dataset poisoning percentage all affect successful embedding of trojans. We test Neural Cleanse against the trojaned MNIST models and successfully detect anomalies in the trained models approximately $18\%$ of the time. Our experiments and workflow indicate that the TrojAI software framework will enable researchers to easily understand the effects of various configurations of the dataset and training hyperparameters on the generated trojaned deep learning model, and can be used to rapidly and comprehensively test new trojan detection methods.

* 8 pages, 16 figures

View paper on

Share this with someone who'll enjoy it:

Title:The TrojAI Software Framework: An OpenSource tool for Embedding Trojans into Deep Learning Models

Paper and Code