Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Information": models, code, and papers

Outlier-based Autism Detection using Longitudinal Structural MRI

Feb 21, 2022
Devika K, Venkata Ramana Murthy Oruganti, Dwarikanath Mahapatra, Ramanathan Subramanian

Figure 1 for Outlier-based Autism Detection using Longitudinal Structural MRI

Figure 2 for Outlier-based Autism Detection using Longitudinal Structural MRI

Figure 3 for Outlier-based Autism Detection using Longitudinal Structural MRI

Figure 4 for Outlier-based Autism Detection using Longitudinal Structural MRI

Diagnosis of Autism Spectrum Disorder (ASD) using clinical evaluation (cognitive tests) is challenging due to wide variations amongst individuals. Since no effective treatment exists, prompt and reliable ASD diagnosis can enable the effective preparation of treatment regimens. This paper proposes structural Magnetic Resonance Imaging (sMRI)-based ASD diagnosis via an outlier detection approach. To learn Spatio-temporal patterns in structural brain connectivity, a Generative Adversarial Network (GAN) is trained exclusively with sMRI scans of healthy subjects. Given a stack of three adjacent slices as input, the GAN generator reconstructs the next three adjacent slices; the GAN discriminator then identifies ASD sMRI scan reconstructions as outliers. This model is compared against two other baselines -- a simpler UNet and a sophisticated Self-Attention GAN. Axial, Coronal, and Sagittal sMRI slices from the multi-site ABIDE II dataset are used for evaluation. Extensive experiments reveal that our ASD detection framework performs comparably with the state-of-the-art with far fewer training data. Furthermore, longitudinal data (two scans per subject over time) achieve 17-28% higher accuracy than cross-sectional data (one scan per subject). Among other findings, metrics employed for model training as well as reconstruction loss computation impact detection performance, and the coronal modality is found to best encode structural information for ASD detection.

Via

Access Paper or Ask Questions

Single Trajectory Nonparametric Learning of Nonlinear Dynamics

Feb 16, 2022
Ingvar Ziemann, Henrik Sandberg, Nikolai Matni

Figure 1 for Single Trajectory Nonparametric Learning of Nonlinear Dynamics

Given a single trajectory of a dynamical system, we analyze the performance of the nonparametric least squares estimator (LSE). More precisely, we give nonasymptotic expected $l^2$-distance bounds between the LSE and the true regression function, where expectation is evaluated on a fresh, counterfactual, trajectory. We leverage recently developed information-theoretic methods to establish the optimality of the LSE for nonparametric hypotheses classes in terms of supremum norm metric entropy and a subgaussian parameter. Next, we relate this subgaussian parameter to the stability of the underlying process using notions from dynamical systems theory. When combined, these developments lead to rate-optimal error bounds that scale as $T^{-1/(2+q)}$ for suitably stable processes and hypothesis classes with metric entropy growth of order $\delta^{-q}$. Here, $T$ is the length of the observed trajectory, $\delta \in \mathbb{R}_+$ is the packing granularity and $q\in (0,2)$ is a complexity term. Finally, we specialize our results to a number of scenarios of practical interest, such as Lipschitz dynamics, generalized linear models, and dynamics described by functions in certain classes of Reproducing Kernel Hilbert Spaces (RKHS).

Via

Access Paper or Ask Questions

One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Sep 09, 2020
Mengli Cheng, Minghui Qiu, Xing Shi, Jun Huang, Wei Lin

Figure 1 for One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Figure 2 for One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Figure 3 for One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Figure 4 for One-shot Text Field Labeling using Attention and Belief Propagation for Structure Information Extraction

Structured information extraction from document images usually consists of three steps: text detection, text recognition, and text field labeling. While text detection and text recognition have been heavily studied and improved a lot in literature, text field labeling is less explored and still faces many challenges. Existing learning based methods for text labeling task usually require a large amount of labeled examples to train a specific model for each type of document. However, collecting large amounts of document images and labeling them is difficult and sometimes impossible due to privacy issues. Deploying separate models for each type of document also consumes a lot of resources. Facing these challenges, we explore one-shot learning for the text field labeling task. Existing one-shot learning methods for the task are mostly rule-based and have difficulty in labeling fields in crowded regions with few landmarks and fields consisting of multiple separate text regions. To alleviate these problems, we proposed a novel deep end-to-end trainable approach for one-shot text field labeling, which makes use of attention mechanism to transfer the layout information between document images. We further applied conditional random field on the transferred layout information for the refinement of field labeling. We collected and annotated a real-world one-shot field labeling dataset with a large variety of document types and conducted extensive experiments to examine the effectiveness of the proposed model. To stimulate research in this direction, the collected dataset and the one-shot model will be released1.

* ACMMM 2020
* 9 pages

Via

Access Paper or Ask Questions

Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach

Jan 15, 2022
Tan Wan, L. Jeff Hong

Figure 1 for Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach

Figure 2 for Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach

Figure 3 for Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach

Figure 4 for Large-Scale Inventory Optimization: A Recurrent-Neural-Networks-Inspired Simulation Approach

Many large-scale production networks include thousands types of final products and tens to hundreds thousands types of raw materials and intermediate products. These networks face complicated inventory management decisions, which are often too complicated for inventory models and too large for simulation models. In this paper, by combing efficient computational tools of recurrent neural networks (RNN) and the structural information of production networks, we propose a RNN inspired simulation approach that may be thousands times faster than existing simulation approach and is capable of solving large-scale inventory optimization problems in a reasonable amount of time.

Via

Access Paper or Ask Questions

Estimating Causal Effects of Multi-Aspect Online Reviews with Multi-Modal Proxies

Jan 02, 2022
Lu Cheng, Ruocheng Guo, Huan Liu

Figure 1 for Estimating Causal Effects of Multi-Aspect Online Reviews with Multi-Modal Proxies

Figure 2 for Estimating Causal Effects of Multi-Aspect Online Reviews with Multi-Modal Proxies

Figure 3 for Estimating Causal Effects of Multi-Aspect Online Reviews with Multi-Modal Proxies

Figure 4 for Estimating Causal Effects of Multi-Aspect Online Reviews with Multi-Modal Proxies

Online reviews enable consumers to engage with companies and provide important feedback. Due to the complexity of the high-dimensional text, these reviews are often simplified as a single numerical score, e.g., ratings or sentiment scores. This work empirically examines the causal effects of user-generated online reviews on a granular level: we consider multiple aspects, e.g., the Food and Service of a restaurant. Understanding consumers' opinions toward different aspects can help evaluate business performance in detail and strategize business operations effectively. Specifically, we aim to answer interventional questions such as What will the restaurant popularity be if the quality w.r.t. its aspect Service is increased by 10%? The defining challenge of causal inference with observational data is the presence of "confounder", which might not be observed or measured, e.g., consumers' preference to food type, rendering the estimated effects biased and high-variance. To address this challenge, we have recourse to the multi-modal proxies such as the consumer profile information and interactions between consumers and businesses. We show how to effectively leverage the rich information to identify and estimate causal effects of multiple aspects embedded in online reviews. Empirical evaluations on synthetic and real-world data corroborate the efficacy and shed light on the actionable insight of the proposed approach.

* 10 pages, 6 figures, accepted to WSDM22

Via

Access Paper or Ask Questions

Adaptive Resonance Theory-based Topological Clustering with a Divisive Hierarchical Structure Capable of Continual Learning

Jan 26, 2022
Naoki Masuyama, Narito Amako, Yuna Yamada, Yusuke Nojima, Hisao Ishibuchi

Figure 1 for Adaptive Resonance Theory-based Topological Clustering with a Divisive Hierarchical Structure Capable of Continual Learning

Figure 2 for Adaptive Resonance Theory-based Topological Clustering with a Divisive Hierarchical Structure Capable of Continual Learning

Figure 3 for Adaptive Resonance Theory-based Topological Clustering with a Divisive Hierarchical Structure Capable of Continual Learning

Figure 4 for Adaptive Resonance Theory-based Topological Clustering with a Divisive Hierarchical Structure Capable of Continual Learning

Thanks to an ability for handling the plasticity-stability dilemma, Adaptive Resonance Theory (ART) is considered as an effective approach for realizing continual learning. In general, however, the clustering performance of ART-based algorithms strongly depends on a similarity threshold, i.e., a vigilance parameter, which is data-dependent and specified by hand. This paper proposes an ART-based topological clustering algorithm with a mechanism that automatically estimates a similarity threshold from a distribution of data points. In addition, for the improving information extraction performance, a divisive hierarchical clustering algorithm capable of continual learning is proposed by introducing a hierarchical structure to the proposed algorithm. Simulation experiments show that the proposed algorithm shows the comparative clustering performance compared with recently proposed hierarchical clustering algorithms.

Via

Access Paper or Ask Questions

Generating Chinese Poetry from Images via Concrete and Abstract Information

Mar 24, 2020
Yusen Liu, Dayiheng Liu, Jiancheng Lv, Yongsheng Sang

Figure 1 for Generating Chinese Poetry from Images via Concrete and Abstract Information

Figure 2 for Generating Chinese Poetry from Images via Concrete and Abstract Information

Figure 3 for Generating Chinese Poetry from Images via Concrete and Abstract Information

Figure 4 for Generating Chinese Poetry from Images via Concrete and Abstract Information

In recent years, the automatic generation of classical Chinese poetry has made great progress. Besides focusing on improving the quality of the generated poetry, there is a new topic about generating poetry from an image. However, the existing methods for this topic still have the problem of topic drift and semantic inconsistency, and the image-poem pairs dataset is hard to be built when training these models. In this paper, we extract and integrate the Concrete and Abstract information from images to address those issues. We proposed an infilling-based Chinese poetry generation model which can infill the Concrete keywords into each line of poems in an explicit way, and an abstract information embedding to integrate the Abstract information into generated poems. In addition, we use non-parallel data during training and construct separate image datasets and poem datasets to train the different components in our framework. Both automatic and human evaluation results show that our approach can generate poems which have better consistency with images without losing the quality.

* Accepted by the 2020 International Joint Conference on Neural Networks (IJCNN 2020)

Via

Access Paper or Ask Questions

TSAX is Trending

Dec 24, 2021
Muhammad Marwan Muhammad Fuad

Time series mining is an important branch of data mining, as time series data is ubiquitous and has many applications in several domains. The main task in time series mining is classification. Time series representation methods play an important role in time series classification and other time series mining tasks. One of the most popular representation methods of time series data is the Symbolic Aggregate approXimation (SAX). The secret behind its popularity is its simplicity and efficiency. SAX has however one major drawback, which is its inability to represent trend information. Several methods have been proposed to enable SAX to capture trend information, but this comes at the expense of complex processing, preprocessing, or post-processing procedures. In this paper we present a new modification of SAX that we call Trending SAX (TSAX), which only adds minimal complexity to SAX, but substantially improves its performance in time series classification. This is validated experimentally on 50 datasets. The results show the superior performance of our method, as it gives a smaller classification error on 39 datasets compared with SAX.

* Lecture Notes in Computer Science 12743, Springer 2021
* 21st International Conference on Computational Science (ICCS 2021)

Via

Access Paper or Ask Questions

Efficient Video Segmentation Models with Per-frame Inference

Feb 24, 2022
Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang

Figure 1 for Efficient Video Segmentation Models with Per-frame Inference

Figure 2 for Efficient Video Segmentation Models with Per-frame Inference

Figure 3 for Efficient Video Segmentation Models with Per-frame Inference

Figure 4 for Efficient Video Segmentation Models with Per-frame Inference

Most existing real-time deep models trained with each frame independently may produce inconsistent results across the temporal axis when tested on a video sequence. A few methods take the correlations in the video sequence into account,e.g., by propagating the results to the neighboring frames using optical flow or extracting frame representations using multi-frame information, which may lead to inaccurate results or unbalanced latency. In this work, we focus on improving the temporal consistency without introducing computation overhead in inference. To this end, we perform inference at each frame. Temporal consistency is achieved by learning from video frames with extra constraints during the training phase. introduced for inference. We propose several techniques to learn from the video sequence, including a temporal consistency loss and online/offline knowledge distillation methods. On the task of semantic video segmentation, weighing among accuracy, temporal smoothness, and efficiency, our proposed method outperforms keyframe-based methods and a few baseline methods that are trained with each frame independently, on datasets including Cityscapes, Camvid, and 300VW-Mask. We further apply our training method to video instance segmentation on YouTubeVISand develop an application of portrait matting in video sequences, by segmenting temporally consistent instance-level trimaps across frames. Experiments show superior qualitative and quantitative results. Code is available at: https://git.io/vidseg.

* arXiv admin note: text overlap with arXiv:2002.11433

Via

Access Paper or Ask Questions

Random Orthogonalization for Federated Learning in Massive MIMO Systems

Jan 29, 2022
Xizixiang Wei, Cong Shen, Jing Yang, H. Vincent Poor

Figure 1 for Random Orthogonalization for Federated Learning in Massive MIMO Systems

Figure 2 for Random Orthogonalization for Federated Learning in Massive MIMO Systems

Figure 3 for Random Orthogonalization for Federated Learning in Massive MIMO Systems

Figure 4 for Random Orthogonalization for Federated Learning in Massive MIMO Systems

We propose a novel uplink communication method, coined random orthogonalization, for federated learning (FL) in a massive multiple-input and multiple-output (MIMO) wireless system. The key novelty of random orthogonalization comes from the tight coupling of FL model aggregation and two unique characteristics of massive MIMO - channel hardening and favorable propagation. As a result, random orthogonalization can achieve natural over-the-air model aggregation without requiring transmitter side channel state information, while significantly reducing the channel estimation overhead at the receiver. Theoretical analyses with respect to both communication and machine learning performances are carried out. In particular, an explicit relationship among the convergence rate, the number of clients and the number of antennas is established. Experimental results validate the effectiveness and efficiency of random orthogonalization for FL in massive MIMO.

* 6 pages, 1 table, 4 figures, accepted to International Conference on Communications (ICC) 2022

Via

Access Paper or Ask Questions