Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yike Guo

Privacy Preservation in Federated Learning: Insights from the GDPR Perspective

Nov 10, 2020
Nguyen Truong, Kai Sun, Siyao Wang, Florian Guitton, Yike Guo

Figure 1 for Privacy Preservation in Federated Learning: Insights from the GDPR Perspective

Figure 2 for Privacy Preservation in Federated Learning: Insights from the GDPR Perspective

Figure 3 for Privacy Preservation in Federated Learning: Insights from the GDPR Perspective

Figure 4 for Privacy Preservation in Federated Learning: Insights from the GDPR Perspective

Along with the blooming of AI and Machine Learning-based applications and services, data privacy and security have become a critical challenge. Conventionally, data is collected and aggregated in a data centre on which machine learning models are trained. This centralised approach has induced severe privacy risks to personal data leakage, misuse, and abuse. Furthermore, in the era of the Internet of Things and big data in which data is essentially distributed, transferring a vast amount of data to a data centre for processing seems to be a cumbersome solution. This is not only because of the difficulties in transferring and sharing data across data sources but also the challenges on complying with rigorous data protection regulations and complicated administrative procedures such as the EU General Data Protection Regulation (GDPR). In this respect, Federated learning (FL) emerges as a prospective solution that facilitates distributed collaborative learning without disclosing original training data whilst naturally complying with the GDPR. Recent research has demonstrated that retaining data and computation on-device in FL is not sufficient enough for privacy-guarantee. This is because ML model parameters exchanged between parties in an FL system still conceal sensitive information, which can be exploited in some privacy attacks. Therefore, FL systems shall be empowered by efficient privacy-preserving techniques to comply with the GDPR. This article is dedicated to surveying on the state-of-the-art privacy-preserving techniques which can be employed in FL in a systematic fashion, as well as how these techniques mitigate data security and privacy risks. Furthermore, we provide insights into the challenges along with prospective approaches following the GDPR regulatory guidelines that an FL system shall implement to comply with the GDPR.

* 20 pages, 8 figures, submitted to Elsevier FGCS journal

Via

Access Paper or Ask Questions

Suggestive Annotation of Brain Tumour Images with Gradient-guided Sampling

Jul 03, 2020
Chengliang Dai, Shuo Wang, Yuanhan Mo, Kaichen Zhou, Elsa Angelini, Yike Guo, Wenjia Bai

Figure 1 for Suggestive Annotation of Brain Tumour Images with Gradient-guided Sampling

Figure 2 for Suggestive Annotation of Brain Tumour Images with Gradient-guided Sampling

Figure 3 for Suggestive Annotation of Brain Tumour Images with Gradient-guided Sampling

Figure 4 for Suggestive Annotation of Brain Tumour Images with Gradient-guided Sampling

Machine learning has been widely adopted for medical image analysis in recent years given its promising performance in image segmentation and classification tasks. As a data-driven science, the success of machine learning, in particular supervised learning, largely depends on the availability of manually annotated datasets. For medical imaging applications, such annotated datasets are not easy to acquire. It takes a substantial amount of time and resource to curate an annotated medical image set. In this paper, we propose an efficient annotation framework for brain tumour images that is able to suggest informative sample images for human experts to annotate. Our experiments show that training a segmentation model with only 19% suggestively annotated patient scans from BraTS 2019 dataset can achieve a comparable performance to training a model on the full dataset for whole tumour segmentation task. It demonstrates a promising way to save manual annotation cost and improve data efficiency in medical imaging applications.

* Paper accepted by MICCAI 2020

Via

Access Paper or Ask Questions

Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

Jun 23, 2020
Shuo Wang, Giacomo Tarroni, Chen Qin, Yuanhan Mo, Chengliang Dai, Chen Chen, Ben Glocker, Yike Guo, Daniel Rueckert, Wenjia Bai

Figure 1 for Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

Figure 2 for Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

Figure 3 for Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

Figure 4 for Deep Generative Model-based Quality Control for Cardiac MRI Segmentation

In recent years, convolutional neural networks have demonstrated promising performance in a variety of medical image segmentation tasks. However, when a trained segmentation model is deployed into the real clinical world, the model may not perform optimally. A major challenge is the potential poor-quality segmentations generated due to degraded image quality or domain shift issues. There is a timely need to develop an automated quality control method that can detect poor segmentations and feedback to clinicians. Here we propose a novel deep generative model-based framework for quality control of cardiac MRI segmentation. It first learns a manifold of good-quality image-segmentation pairs using a generative model. The quality of a given test segmentation is then assessed by evaluating the difference from its projection onto the good-quality manifold. In particular, the projection is refined through iterative search in the latent space. The proposed method achieves high prediction accuracy on two publicly available cardiac MRI datasets. Moreover, it shows better generalisation ability than traditional regression-based methods. Our approach provides a real-time and model-agnostic quality control for cardiac MRI segmentation, which has the potential to be integrated into clinical image analysis workflows.

* The paper is accepted to MICCAI 2020

Via

Access Paper or Ask Questions

An Epidemiological Modelling Approach for Covid19 via Data Assimilation

Apr 25, 2020
Philip Nadler, Shuo Wang, Rossella Arcucci, Xian Yang, Yike Guo

Figure 1 for An Epidemiological Modelling Approach for Covid19 via Data Assimilation

Figure 2 for An Epidemiological Modelling Approach for Covid19 via Data Assimilation

Figure 3 for An Epidemiological Modelling Approach for Covid19 via Data Assimilation

Figure 4 for An Epidemiological Modelling Approach for Covid19 via Data Assimilation

The global pandemic of the 2019-nCov requires the evaluation of policy interventions to mitigate future social and economic costs of quarantine measures worldwide. We propose an epidemiological model for forecasting and policy evaluation which incorporates new data in real-time through variational data assimilation. We analyze and discuss infection rates in China, the US and Italy. In particular, we develop a custom compartmental SIR model fit to variables related to the epidemic in Chinese cities, named SITR model. We compare and discuss model results which conducts updates as new observations become available. A hybrid data assimilation approach is applied to make results robust to initial conditions. We use the model to do inference on infection numbers as well as parameters such as the disease transmissibility rate or the rate of recovery. The parameterisation of the model is parsimonious and extendable, allowing for the incorporation of additional data and parameters of interest. This allows for scalability and the extension of the model to other locations or the adaption of novel data sources.

* 8 pages

Via

Access Paper or Ask Questions

Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Apr 12, 2020
Yuanhan Mo, Shuo Wang, Chengliang Dai, Rui Zhou, Zhongzhao Teng, Wenjia Bai, Yike Guo

Figure 1 for Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Figure 2 for Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Figure 3 for Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Figure 4 for Efficient Deep Representation Learning by Adaptive Latent Space Sampling

Supervised deep learning requires a large amount of training samples with annotations (e.g. label class for classification task, pixel- or voxel-wised label map for segmentation tasks), which are expensive and time-consuming to obtain. During the training of a deep neural network, the annotated samples are fed into the network in a mini-batch way, where they are often regarded of equal importance. However, some of the samples may become less informative during training, as the magnitude of the gradient start to vanish for these samples. In the meantime, other samples of higher utility or hardness may be more demanded for the training process to proceed and require more exploitation. To address the challenges of expensive annotations and loss of sample informativeness, here we propose a novel training framework which adaptively selects informative samples that are fed to the training process. The adaptive selection or sampling is performed based on a hardness-aware strategy in the latent space constructed by a generative model. To evaluate the proposed training framework, we perform experiments on three different datasets, including MNIST and CIFAR-10 for image classification task and a medical image dataset IVUS for biophysical simulation task. On all three datasets, the proposed framework outperforms a random sampling method, which demonstrates the effectiveness of proposed framework.

Via

Access Paper or Ask Questions

Regularizing Deep Multi-Task Networks using Orthogonal Gradients

Dec 14, 2019
Mihai Suteu, Yike Guo

Figure 1 for Regularizing Deep Multi-Task Networks using Orthogonal Gradients

Figure 2 for Regularizing Deep Multi-Task Networks using Orthogonal Gradients

Figure 3 for Regularizing Deep Multi-Task Networks using Orthogonal Gradients

Figure 4 for Regularizing Deep Multi-Task Networks using Orthogonal Gradients

Deep neural networks are a promising approach towards multi-task learning because of their capability to leverage knowledge across domains and learn general purpose representations. Nevertheless, they can fail to live up to these promises as tasks often compete for a model's limited resources, potentially leading to lower overall performance. In this work we tackle the issue of interfering tasks through a comprehensive analysis of their training, derived from looking at the interaction between gradients within their shared parameters. Our empirical results show that well-performing models have low variance in the angles between task gradients and that popular regularization methods implicitly reduce this measure. Based on this observation, we propose a novel gradient regularization term that minimizes task interference by enforcing near orthogonal gradients. Updating the shared parameters using this property encourages task specific decoders to optimize different parts of the feature extractor, thus reducing competition. We evaluate our method with classification and regression tasks on the multiDigitMNIST, NYUv2 and SUN RGB-D datasets where we obtain competitive results.

* 11 pages, 5 figures

Via

Access Paper or Ask Questions

Biologically inspired architectures for sample-efficient deep reinforcement learning

Nov 25, 2019
Pierre H. Richemond, Arinbjörn Kolbeinsson, Yike Guo

Figure 1 for Biologically inspired architectures for sample-efficient deep reinforcement learning

Figure 2 for Biologically inspired architectures for sample-efficient deep reinforcement learning

Figure 3 for Biologically inspired architectures for sample-efficient deep reinforcement learning

Figure 4 for Biologically inspired architectures for sample-efficient deep reinforcement learning

Deep reinforcement learning requires a heavy price in terms of sample efficiency and overparameterization in the neural networks used for function approximation. In this work, we use tensor factorization in order to learn more compact representation for reinforcement learning policies. We show empirically that in the low-data regime, it is possible to learn online policies with 2 to 10 times less total coefficients, with little to no loss of performance. We also leverage progress in second order optimization, and use the theory of wavelet scattering to further reduce the number of learned coefficients, by foregoing learning the topmost convolutional layer filters altogether. We evaluate our results on the Atari suite against recent baseline algorithms that represent the state-of-the-art in data efficiency, and get comparable results with an order of magnitude gain in weight parsimony.

* Deep Reinforcement Learning Workshop, NeurIPS 2019, Vancouver, Canada

Via

Access Paper or Ask Questions

Automatic Brain Tumour Segmentation and Biophysics-Guided Survival Prediction

Nov 19, 2019
Shuo Wang, Chengliang Dai, Yuanhan Mo, Elsa Angelini, Yike Guo, Wenjia Bai

Figure 1 for Automatic Brain Tumour Segmentation and Biophysics-Guided Survival Prediction

Figure 2 for Automatic Brain Tumour Segmentation and Biophysics-Guided Survival Prediction

Figure 3 for Automatic Brain Tumour Segmentation and Biophysics-Guided Survival Prediction

Figure 4 for Automatic Brain Tumour Segmentation and Biophysics-Guided Survival Prediction

Gliomas are the most common malignant brain tumourswith intrinsic heterogeneity. Accurate segmentation of gliomas and theirsub-regions on multi-parametric magnetic resonance images (mpMRI)is of great clinical importance, which defines tumour size, shape andappearance and provides abundant information for preoperative diag-nosis, treatment planning and survival prediction. Recent developmentson deep learning have significantly improved the performance of auto-mated medical image segmentation. In this paper, we compare severalstate-of-the-art convolutional neural network models for brain tumourimage segmentation. Based on the ensembled segmentation, we presenta biophysics-guided prognostic model for patient overall survival predic-tion which outperforms a data-driven radiomics approach. Our methodwon the second place of the MICCAI 2019 BraTS Challenge for theoverall survival prediction.

* MICCAI BraTS 2019 Challenge

Via

Access Paper or Ask Questions

Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

Nov 10, 2019
Jingqing Zhang, Xiaoyu Zhang, Kai Sun, Xian Yang, Chengliang Dai, Yike Guo

Figure 1 for Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

Figure 2 for Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

Figure 3 for Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

Figure 4 for Unsupervised Annotation of Phenotypic Abnormalities via Semantic Latent Representations on Electronic Health Records

The extraction of phenotype information which is naturally contained in electronic health records (EHRs) has been found to be useful in various clinical informatics applications such as disease diagnosis. However, due to imprecise descriptions, lack of gold standards and the demand for efficiency, annotating phenotypic abnormalities on millions of EHR narratives is still challenging. In this work, we propose a novel unsupervised deep learning framework to annotate the phenotypic abnormalities from EHRs via semantic latent representations. The proposed framework takes the advantage of Human Phenotype Ontology (HPO), which is a knowledge base of phenotypic abnormalities, to standardize the annotation results. Experiments have been conducted on 52,722 EHRs from MIMIC-III dataset. Quantitative and qualitative analysis have shown the proposed framework achieves state-of-the-art annotation performance and computational efficiency compared with other methods.

* Accepted by BIBM 2019 (Regular)

Via

Access Paper or Ask Questions