Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Austin Clyde

Division of Data Science and Learning, Argonne National Laboratory, Argonne, IL, USA, Department of Computer Science, The University of Chicago, Chicago, IL, USA

A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

May 04, 2020

Austin Clyde, Tom Brettin, Alexander Partin, Maulik Shaulik, Hyunseung Yoo, Yvonne Evrard, Yitan Zhu, Fangfang Xia, Rick Stevens

Figure 1 for A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

Figure 2 for A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

Figure 3 for A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

Figure 4 for A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep Learning

Abstract:By combining various cancer cell line (CCL) drug screening panels, the size of the data has grown significantly to begin understanding how advances in deep learning can advance drug response predictions. In this paper we train >35,000 neural network models, sweeping over common featurization techniques. We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features. We found the inclusion of single nucleotide polymorphisms (SNPs) coded as count matrices improved model performance significantly, and no substantial difference in model performance with respect to molecular featurization between the common open source MOrdred descriptors and Dragon7 descriptors. Alongside this analysis, we outline data integration between CCL screening datasets and present evidence that new metrics and imbalanced data techniques, as well as advances in data standardization, need to be developed.

Via

Access Paper or Ask Questions