Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dmitry Kazhdan

Now You See Me : Concept-based Model Extraction

Oct 25, 2020

Dmitry Kazhdan, Botty Dimanov, Mateja Jamnik, Pietro Liò, Adrian Weller

Figure 1 for Now You See Me : Concept-based Model Extraction

Figure 2 for Now You See Me : Concept-based Model Extraction

Figure 3 for Now You See Me : Concept-based Model Extraction

Figure 4 for Now You See Me : Concept-based Model Extraction

Abstract:Deep Neural Networks (DNNs) have achieved remarkable performance on a range of tasks. A key step to further empowering DNN-based approaches is improving their explainability. In this work we present CME: a concept-based model extraction framework, used for analysing DNN models via concept-based extracted models. Using two case studies (dSprites, and Caltech UCSD Birds), we demonstrate how CME can be used to (i) analyse the concept information learned by a DNN model (ii) analyse how a DNN uses this concept information when predicting output labels (iii) identify key concept information that can further improve DNN predictive performance (for one of the case studies, we showed how model accuracy can be improved by over 14%, using only 30% of the available concepts).

* Presented at the AIMLAI workshop at the 29th ACM International Conference on Information and Knowledge Management (CIKM 2020)

Via

Access Paper or Ask Questions

MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Apr 16, 2020

Dmitry Kazhdan, Zohreh Shams, Pietro Liò

Figure 1 for MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Figure 2 for MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Figure 3 for MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Figure 4 for MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library

Abstract:Multi-Agent Reinforcement Learning (MARL) encompasses a powerful class of methodologies that have been applied in a wide range of fields. An effective way to further empower these methodologies is to develop libraries and tools that could expand their interpretability and explainability. In this work, we introduce MARLeME: a MARL model extraction library, designed to improve explainability of MARL systems by approximating them with symbolic models. Symbolic models offer a high degree of interpretability, well-defined properties, and verifiable behaviour. Consequently, they can be used to inspect and better understand the underlying MARL system and corresponding MARL agents, as well as to replace all/some of the agents that are particularly safety and security critical.

* Presented at the KR2ML workshop at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

Via

Access Paper or Ask Questions