Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Time": models, code, and papers

Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

Nov 18, 2018
Weijun Lu, Xiao-Yang Liu, Qingwei Wu, Yue Sun, Anwar Walid

Figure 1 for Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

Figure 2 for Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

Figure 3 for Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

Figure 4 for Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

We propose a novel multilinear dynamical system (MLDS) in a transform domain, named $\mathcal{L}$-MLDS, to model tensor time series. With transformations applied to a tensor data, the latent multidimensional correlations among the frontal slices are built, and thus resulting in the computational independence in the transform domain. This allows the exact separability of the multi-dimensional problem into multiple smaller LDS problems. To estimate the system parameters, we utilize the expectation-maximization (EM) algorithm to determine the parameters of each LDS. Further, $\mathcal{L}$-MLDSs significantly reduce the model parameters and allows parallel processing. Our general $\mathcal{L}$-MLDS model is implemented based on different transforms: discrete Fourier transform, discrete cosine transform and discrete wavelet transform. Due to the nonlinearity of these transformations, $\mathcal{L}$-MLDS is able to capture the nonlinear correlations within the data unlike the MLDS \cite{rogers2013multilinear} which assumes multi-way linear correlations. Using four real datasets, the proposed $\mathcal{L}$-MLDS is shown to achieve much higher prediction accuracy than the state-of-the-art MLDS and LDS with an equal number of parameters under different noise models. In particular, the relative errors are reduced by $50\% \sim 99\%$. Simultaneously, $\mathcal{L}$-MLDS achieves an exponential improvement in the model's training time than MLDS.

Via

Access Paper or Ask Questions

Investigating the Evolvability of Web Page Load Time

Feb 22, 2018
Brendan Cody-Kenny, Umberto Manganiello, John Farrelly, Adrian Ronayne, Eoghan Considine, Thomas McGuire, Michael O'Neill

Figure 1 for Investigating the Evolvability of Web Page Load Time

Figure 2 for Investigating the Evolvability of Web Page Load Time

Client-side Javascript execution environments (browsers) allow anonymous functions and event-based programming concepts such as callbacks. We investigate whether a mutate-and-test approach can be used to optimise web page load time in these environments. First, we characterise a web page load issue in a benchmark web page and derive performance metrics from page load event traces. We parse Javascript source code to an AST and make changes to method calls which appear in a web page load event trace. We present an operator based solely on code deletion and evaluate an existing "community-contributed" performance optimising code transform. By exploring Javascript code changes and exploiting combinations of non-destructive changes, we can optimise page load time by 41% in our benchmark web page.

* 8 Pages, to appear in EvoSET 2018

Via

Access Paper or Ask Questions

Initializing LSTM internal states via manifold learning

Apr 27, 2021
Felix P. Kemeth, Tom Bertalan, Nikolaos Evangelou, Tianqi Cui, Saurabh Malani, Ioannis G. Kevrekidis

Figure 1 for Initializing LSTM internal states via manifold learning

Figure 2 for Initializing LSTM internal states via manifold learning

Figure 3 for Initializing LSTM internal states via manifold learning

Figure 4 for Initializing LSTM internal states via manifold learning

We present an approach, based on learning an intrinsic data manifold, for the initialization of the internal state values of LSTM recurrent neural networks, ensuring consistency with the initial observed input data. Exploiting the generalized synchronization concept, we argue that the converged, "mature" internal states constitute a function on this learned manifold. The dimension of this manifold then dictates the length of observed input time series data required for consistent initialization. We illustrate our approach through a partially observed chemical model system, where initializing the internal LSTM states in this fashion yields visibly improved performance. Finally, we show that learning this data manifold enables the transformation of partially observed dynamics into fully observed ones, facilitating alternative identification paths for nonlinear dynamical systems.

Via

Access Paper or Ask Questions

End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Jun 01, 2021
Haitian Sun, William W. Cohen, Ruslan Salakhutdinov

Figure 1 for End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Figure 2 for End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Figure 3 for End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Figure 4 for End-to-End Multihop Retrieval for Compositional Question Answering over Long Documents

Answering complex questions from long documents requires aggregating multiple pieces of evidence and then predicting the answers. In this paper, we propose a multi-hop retrieval method, DocHopper, to answer compositional questions over long documents. At each step, DocHopper retrieves a paragraph or sentence embedding from the document, mixes the retrieved result with the query, and updates the query for the next step. In contrast to many other retrieval-based methods (e.g., RAG or REALM) the query is not augmented with a token sequence: instead, it is augmented by "numerically" combining it with another neural representation. This means that model is end-to-end differentiable. We demonstrate that utilizing document structure in this was can largely improve question-answering and retrieval performance on long documents. We experimented with DocHopper on three different QA tasks that require reading long documents to answer compositional questions: discourse entailment reasoning, factual QA with table and text, and information seeking QA from academic papers. DocHopper outperforms all baseline models and achieves state-of-the-art results on all datasets. Additionally, DocHopper is efficient at inference time, being 3~10 times faster than the baselines.

Via

Access Paper or Ask Questions

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Mar 31, 2021
Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Jia Ye, RJ Ryan, Yonghui Wu

Figure 1 for Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Figure 2 for Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Figure 3 for Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Figure 4 for Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

This paper introduces Parallel Tacotron 2, a non-autoregressive neural text-to-speech model with a fully differentiable duration model which does not require supervised duration signals. The duration model is based on a novel attention mechanism and an iterative reconstruction loss based on Soft Dynamic Time Warping, this model can learn token-frame alignments as well as token durations automatically. Experimental results show that Parallel Tacotron 2 outperforms baselines in subjective naturalness in several diverse multi speaker evaluations. Its duration control capability is also demonstrated.

* Submitted to INTERSPEECH 2021

Via

Access Paper or Ask Questions

Language Models are Few-Shot Butlers

Apr 16, 2021
Vincent Micheli, François Fleuret

Figure 1 for Language Models are Few-Shot Butlers

Figure 2 for Language Models are Few-Shot Butlers

Figure 3 for Language Models are Few-Shot Butlers

Pretrained language models demonstrate strong performance in most NLP tasks when fine-tuned on small task-specific datasets. Hence, these autoregressive models constitute ideal agents to operate in text-based environments where language understanding and generative capabilities are essential. Nonetheless, collecting expert demonstrations in such environments is a time-consuming endeavour. We introduce a two-stage procedure to learn from a small set of demonstrations and further improve by interacting with an environment. We show that language models fine-tuned with only 1.2% of the expert demonstrations and a simple reinforcement learning algorithm achieve a 51% absolute improvement in success rate over existing methods in the ALFWorld environment.

Via

Access Paper or Ask Questions

FEAR: A Simple Lightweight Method to Rank Architectures

Jun 07, 2021
Debadeepta Dey, Shital Shah, Sebastien Bubeck

Figure 1 for FEAR: A Simple Lightweight Method to Rank Architectures

Figure 2 for FEAR: A Simple Lightweight Method to Rank Architectures

Figure 3 for FEAR: A Simple Lightweight Method to Rank Architectures

Figure 4 for FEAR: A Simple Lightweight Method to Rank Architectures

The fundamental problem in Neural Architecture Search (NAS) is to efficiently find high-performing architectures from a given search space. We propose a simple but powerful method which we call FEAR, for ranking architectures in any search space. FEAR leverages the viewpoint that neural networks are powerful non-linear feature extractors. First, we train different architectures in the search space to the same training or validation error. Then, we compare the usefulness of the features extracted by each architecture. We do so with a quick training keeping most of the architecture frozen. This gives fast estimates of the relative performance. We validate FEAR on Natsbench topology search space on three different datasets against competing baselines and show strong ranking correlation especially compared to recently proposed zero-cost methods. FEAR particularly excels at ranking high-performance architectures in the search space. When used in the inner loop of discrete search algorithms like random search, FEAR can cut down the search time by approximately 2.4X without losing accuracy. We additionally empirically study very recently proposed zero-cost measures for ranking and find that they breakdown in ranking performance as training proceeds and also that data-agnostic ranking scores which ignore the dataset do not generalize across dissimilar datasets.

* 31 pages, 8 figures

Via

Access Paper or Ask Questions

Model identification for ARMA time series through convolutional neural networks

Apr 12, 2018
Wai Hoh Tang, Adrian Röllin

Figure 1 for Model identification for ARMA time series through convolutional neural networks

Figure 2 for Model identification for ARMA time series through convolutional neural networks

Figure 3 for Model identification for ARMA time series through convolutional neural networks

Figure 4 for Model identification for ARMA time series through convolutional neural networks

In this paper, we use convolutional neural networks to address the problem of model identification for autoregressive moving average time series models. We compare the performance of several neural network architectures, trained on simulated time series, with likelihood based methods, in particular the Akaike and Bayesian information criteria. We find that our neural networks can significantly outperform these likelihood based methods in terms of accuracy and, by orders of magnitude, in terms of speed.

* 22 pages, 15 figures, 10 tables

Via

Access Paper or Ask Questions

Modeling preference time in middle distance triathlons

Jul 03, 2017
Iztok Fister, Andres Iglesias, Suash Deb, Dušan Fister, Iztok Fister Jr

Figure 1 for Modeling preference time in middle distance triathlons

Figure 2 for Modeling preference time in middle distance triathlons

Figure 3 for Modeling preference time in middle distance triathlons

Figure 4 for Modeling preference time in middle distance triathlons

Modeling preference time in triathlons means predicting the intermediate times of particular sports disciplines by a given overall finish time in a specific triathlon course for the athlete with the known personal best result. This is a hard task for athletes and sport trainers due to a lot of different factors that need to be taken into account, e.g., athlete's abilities, health, mental preparations and even their current sports form. So far, this process was calculated manually without any specific software tools or using the artificial intelligence. This paper presents the new solution for modeling preference time in middle distance triathlons based on particle swarm optimization algorithm and archive of existing sports results. Initial results are presented, which suggest the usefulness of proposed approach, while remarks for future improvements and use are also emphasized.

* ISCBI 2017

Via

Access Paper or Ask Questions

Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models

Jun 07, 2021
Ameya D. Patil, Michael Tuttle, Alexander G. Schwing, Naresh R. Shanbhag

$Figure 1 for Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models$

$Figure 2 for Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models$

$Figure 3 for Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models$

$Figure 4 for Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models$

Classical adversarial training (AT) frameworks are designed to achieve high adversarial accuracy against a single attack type, typically $\ell_\infty$ norm-bounded perturbations. Recent extensions in AT have focused on defending against the union of multiple perturbations but this benefit is obtained at the expense of a significant (up to $10\times$) increase in training complexity over single-attack $\ell_\infty$ AT. In this work, we expand the capabilities of widely popular single-attack $\ell_\infty$ AT frameworks to provide robustness to the union of ($\ell_\infty, \ell_2, \ell_1$) perturbations while preserving their training efficiency. Our technique, referred to as Shaped Noise Augmented Processing (SNAP), exploits a well-established byproduct of single-attack AT frameworks -- the reduction in the curvature of the decision boundary of networks. SNAP prepends a given deep net with a shaped noise augmentation layer whose distribution is learned along with network parameters using any standard single-attack AT. As a result, SNAP enhances adversarial accuracy of ResNet-18 on CIFAR-10 against the union of ($\ell_\infty, \ell_2, \ell_1$) perturbations by 14%-to-20% for four state-of-the-art (SOTA) single-attack $\ell_\infty$ AT frameworks, and, for the first time, establishes a benchmark for ResNet-50 and ResNet-101 on ImageNet.

Via

Access Paper or Ask Questions