Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuo Sun

Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Oct 29, 2023

Zhen Qian, Min Chen, Zhuo Sun, Fan Zhang, Qingsong Xu, Jinzhao Guo, Zhiwei Xie, Zhixin Zhang

Figure 1 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Figure 2 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Figure 3 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Figure 4 for Multi-task deep learning for large-scale building detail extraction from high-resolution satellite imagery

Abstract:Understanding urban dynamics and promoting sustainable development requires comprehensive insights about buildings. While geospatial artificial intelligence has advanced the extraction of such details from Earth observational data, existing methods often suffer from computational inefficiencies and inconsistencies when compiling unified building-related datasets for practical applications. To bridge this gap, we introduce the Multi-task Building Refiner (MT-BR), an adaptable neural network tailored for simultaneous extraction of spatial and attributional building details from high-resolution satellite imagery, exemplified by building rooftops, urban functional types, and roof architectural types. Notably, MT-BR can be fine-tuned to incorporate additional building details, extending its applicability. For large-scale applications, we devise a novel spatial sampling scheme that strategically selects limited but representative image samples. This process optimizes both the spatial distribution of samples and the urban environmental characteristics they contain, thus enhancing extraction effectiveness while curtailing data preparation expenditures. We further enhance MT-BR's predictive performance and generalization capabilities through the integration of advanced augmentation techniques. Our quantitative results highlight the efficacy of the proposed methods. Specifically, networks trained with datasets curated via our sampling method demonstrate improved predictive accuracy relative to those using alternative sampling approaches, with no alterations to network architecture. Moreover, MT-BR consistently outperforms other state-of-the-art methods in extracting building details across various metrics. The real-world practicality is also demonstrated in an application across Shanghai, generating a unified dataset that encompasses both the spatial and attributional details of buildings.

Via

Access Paper or Ask Questions

Meta-learning Control Variates: Variance Reduction with Limited Data

Mar 15, 2023

Zhuo Sun, Chris J. Oates, François-Xavier Briol

Figure 1 for Meta-learning Control Variates: Variance Reduction with Limited Data

Figure 2 for Meta-learning Control Variates: Variance Reduction with Limited Data

Figure 3 for Meta-learning Control Variates: Variance Reduction with Limited Data

Figure 4 for Meta-learning Control Variates: Variance Reduction with Limited Data

Abstract:Control variates can be a powerful tool to reduce the variance of Monte Carlo estimators, but constructing effective control variates can be challenging when the number of samples is small. In this paper, we show that when a large number of related integrals need to be computed, it is possible to leverage the similarity between these integration tasks to improve performance even when the number of samples per task is very small. Our approach, called meta learning CVs (Meta-CVs), can be used for up to hundreds or thousands of tasks. Our empirical assessment indicates that Meta-CVs can lead to significant variance reduction in such settings, and our theoretical analysis establishes general conditions under which Meta-CVs can be successfully trained.

Via

Access Paper or Ask Questions

Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Sep 13, 2021

Zhifeng Tang, Zhuo Sun, Nan Yang, Xiangyun Zhou

Figure 1 for Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Figure 2 for Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Figure 3 for Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Figure 4 for Whittle Index Based Scheduling Policy for Minimizing the Cost of Age of Information

Abstract:We design a new scheduling policy to minimize the general non-decreasing cost function of age of information (AoI) in a multiuser system. In this system, the base station stochastically generates time-sensitive packets and transmits them to corresponding user equipments via an unreliable channel. We first formulate the transmission scheduling problem as an average cost constrained Markov decision process problem. Through introducing the service charge, we derive the closed-form expression for the Whittle index, based on which we design the scheduling policy. Using numerical results, we demonstrate the performance gain of our designed scheduling policy compared to the existing policies, such as the optimal policy, the on-demand Whittle index policy, and the age greedy policy.

Via

Access Paper or Ask Questions

Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Feb 25, 2021

Jingjing Li, Zhuo Sun, Lei Zhang, Hongyu Zhu

Figure 1 for Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Figure 2 for Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Figure 3 for Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Figure 4 for Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Abstract:Recently, some researches are devoted to the topic of end-to-end learning a physical layer secure communication system based on autoencoder under Gaussian wiretap channel. However, in those works, the reliability and security of the encoder model were learned through necessary decoding outputs of not only legitimate receiver but also the eavesdropper. In fact, the assumption of known eavesdropper's decoder or its output is not practical. To address this issue, in this paper we propose a dual mutual information neural estimation (MINE) based neural secure communications model. The security constraints of this method is constructed only with the input and output signal samples of the legal and eavesdropper channels and benefit that training the encoder is completely independent of the decoder. Moreover, since the design of secure coding does not rely on the eavesdropper's decoding results, the security performance would not be affected by the eavesdropper's decoding means. Numerical results show that the performance of our model is guaranteed whether the eavesdropper learns the decoder himself or uses the legal decoder.

* 6 pages, 6 figures, ICC 2021

Via

Access Paper or Ask Questions

BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Nov 29, 2020

Xiaoxu Li, Jijie Wu, Zhuo Sun, Zhanyu Ma, Jie Cao, Jing-Hao Xue

Figure 1 for BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Figure 2 for BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Figure 3 for BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Figure 4 for BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Abstract:Few-shot learning for fine-grained image classification has gained recent attention in computer vision. Among the approaches for few-shot learning, due to the simplicity and effectiveness, metric-based methods are favorably state-of-the-art on many tasks. Most of the metric-based methods assume a single similarity measure and thus obtain a single feature space. However, if samples can simultaneously be well classified via two distinct similarity measures, the samples within a class can distribute more compactly in a smaller feature space, producing more discriminative feature maps. Motivated by this, we propose a so-called \textit{Bi-Similarity Network} (\textit{BSNet}) that consists of a single embedding module and a bi-similarity module of two similarity measures. After the support images and the query images pass through the convolution-based embedding module, the bi-similarity module learns feature maps according to two similarity measures of diverse characteristics. In this way, the model is enabled to learn more discriminative and less similarity-biased features from few shots of fine-grained images, such that the model generalization ability can be significantly improved. Through extensive experiments by slightly modifying established metric/similarity based networks, we show that the proposed approach produces a substantial improvement on several fine-grained image benchmark datasets. Codes are available at: https://github.com/spraise/BSNet

* IEEE TIP 2020

Via

Access Paper or Ask Questions

A Concise Review of Recent Few-shot Meta-learning Methods

May 22, 2020

Xiaoxu Li, Zhuo Sun, Jing-Hao Xue, Zhanyu Ma

Figure 1 for A Concise Review of Recent Few-shot Meta-learning Methods

Abstract:Few-shot meta-learning has been recently reviving with expectations to mimic humanity's fast adaption to new concepts based on prior knowledge. In this short communication, we give a concise review on recent representative methods in few-shot meta-learning, which are categorized into four branches according to their technical characteristics. We conclude this review with some vital current challenges and future prospects in few-shot meta-learning.

* 7 pages

Via

Access Paper or Ask Questions

CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation

Aug 28, 2019

Gang Xu, Zhigang Song, Zhuo Sun, Calvin Ku, Zhe Yang, Cancheng Liu, Shuhao Wang, Jianpeng Ma, Wei Xu

Figure 1 for CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation

Figure 2 for CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation

Figure 3 for CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation

Figure 4 for CAMEL: A Weakly Supervised Learning Framework for Histopathology Image Segmentation

Abstract:Histopathology image analysis plays a critical role in cancer diagnosis and treatment. To automatically segment the cancerous regions, fully supervised segmentation algorithms require labor-intensive and time-consuming labeling at the pixel level. In this research, we propose CAMEL, a weakly supervised learning framework for histopathology image segmentation using only image-level labels. Using multiple instance learning (MIL)-based label enrichment, CAMEL splits the image into latticed instances and automatically generates instance-level labels. After label enrichment, the instance-level labels are further assigned to the corresponding pixels, producing the approximate pixel-level labels and making fully supervised training of segmentation models possible. CAMEL achieves comparable performance with the fully supervised approaches in both instance-level classification and pixel-level segmentation on CAMELYON16 and a colorectal adenoma dataset. Moreover, the generality of the automatic labeling methodology may benefit future weakly supervised learning studies for histopathology image analysis.

* 10 pages, 9 figures, accepted by ICCV 2019

Via

Access Paper or Ask Questions