Alert button
Picture for Emmanuel Ekwedike

Emmanuel Ekwedike

Alert button

TOP: Backdoor Detection in Neural Networks via Transferability of Perturbation

Mar 18, 2021
Todd Huster, Emmanuel Ekwedike

Figure 1 for TOP: Backdoor Detection in Neural Networks via Transferability of Perturbation
Figure 2 for TOP: Backdoor Detection in Neural Networks via Transferability of Perturbation
Figure 3 for TOP: Backdoor Detection in Neural Networks via Transferability of Perturbation
Figure 4 for TOP: Backdoor Detection in Neural Networks via Transferability of Perturbation

Deep neural networks (DNNs) are vulnerable to "backdoor" poisoning attacks, in which an adversary implants a secret trigger into an otherwise normally functioning model. Detection of backdoors in trained models without access to the training data or example triggers is an important open problem. In this paper, we identify an interesting property of these models: adversarial perturbations transfer from image to image more readily in poisoned models than in clean models. This holds for a variety of model and trigger types, including triggers that are not linearly separable from clean data. We use this feature to detect poisoned models in the TrojAI benchmark, as well as additional models.

Viaarxiv icon

Feedback-Based Tree Search for Reinforcement Learning

May 15, 2018
Daniel R. Jiang, Emmanuel Ekwedike, Han Liu

Figure 1 for Feedback-Based Tree Search for Reinforcement Learning
Figure 2 for Feedback-Based Tree Search for Reinforcement Learning
Figure 3 for Feedback-Based Tree Search for Reinforcement Learning
Figure 4 for Feedback-Based Tree Search for Reinforcement Learning

Inspired by recent successes of Monte-Carlo tree search (MCTS) in a number of artificial intelligence (AI) application domains, we propose a model-based reinforcement learning (RL) technique that iteratively applies MCTS on batches of small, finite-horizon versions of the original infinite-horizon Markov decision process. The terminal condition of the finite-horizon problems, or the leaf-node evaluator of the decision tree generated by MCTS, is specified using a combination of an estimated value function and an estimated policy function. The recommendations generated by the MCTS procedure are then provided as feedback in order to refine, through classification and regression, the leaf-node evaluator for the next iteration. We provide the first sample complexity bounds for a tree search-based RL algorithm. In addition, we show that a deep neural network implementation of the technique can create a competitive AI agent for the popular multi-player online battle arena (MOBA) game King of Glory.

* 19 pages, to be presented at ICML 2018 
Viaarxiv icon