Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrzej Cichocki

L3C-Stereo: Lossless Compression for Stereo Images

Aug 21, 2021

Zihao Huang, Zhe Sun, Feng Duan, Andrzej Cichocki, Peiying Ruan, Chao Li

Figure 1 for L3C-Stereo: Lossless Compression for Stereo Images

Figure 2 for L3C-Stereo: Lossless Compression for Stereo Images

Figure 3 for L3C-Stereo: Lossless Compression for Stereo Images

Figure 4 for L3C-Stereo: Lossless Compression for Stereo Images

Abstract:A large number of autonomous driving tasks need high-definition stereo images, which requires a large amount of storage space. Efficiently executing lossless compression has become a practical problem. Commonly, it is hard to make accurate probability estimates for each pixel. To tackle this, we propose L3C-Stereo, a multi-scale lossless compression model consisting of two main modules: the warping module and the probability estimation module. The warping module takes advantage of two view feature maps from the same domain to generate a disparity map, which is used to reconstruct the right view so as to improve the confidence of the probability estimate of the right view. The probability estimation module provides pixel-wise logistic mixture distributions for adaptive arithmetic coding. In the experiments, our method outperforms the hand-crafted compression methods and the learning-based method on all three datasets used. Then, we show that a better maximum disparity can lead to a better compression effect. Furthermore, thanks to a compression property of our model, it naturally generates a disparity map of an acceptable quality for the subsequent stereo tasks.

Via

Access Paper or Ask Questions

Meta-Solver for Neural Ordinary Differential Equations

Mar 15, 2021

Julia Gusak, Alexandr Katrutsa, Talgat Daulbaev, Andrzej Cichocki, Ivan Oseledets

Figure 1 for Meta-Solver for Neural Ordinary Differential Equations

Figure 2 for Meta-Solver for Neural Ordinary Differential Equations

Figure 3 for Meta-Solver for Neural Ordinary Differential Equations

Figure 4 for Meta-Solver for Neural Ordinary Differential Equations

Abstract:A conventional approach to train neural ordinary differential equations (ODEs) is to fix an ODE solver and then learn the neural network's weights to optimize a target loss function. However, such an approach is tailored for a specific discretization method and its properties, which may not be optimal for the selected application and yield the overfitting to the given solver. In our paper, we investigate how the variability in solvers' space can improve neural ODEs performance. We consider a family of Runge-Kutta methods that are parameterized by no more than two scalar variables. Based on the solvers' properties, we propose an approach to decrease neural ODEs overfitting to the pre-defined solver, along with a criterion to evaluate such behaviour. Moreover, we show that the right choice of solver parameterization can significantly affect neural ODEs models in terms of robustness to adversarial attacks. Recently it was shown that neural ODEs demonstrate superiority over conventional CNNs in terms of robustness. Our work demonstrates that the model robustness can be further improved by optimizing solver choice for a given task. The source code to reproduce our experiments is available at https://github.com/juliagusak/neural-ode-metasolver.

Via

Access Paper or Ask Questions

Improving EEG Decoding via Clustering-based Multi-task Feature Learning

Dec 12, 2020

Yu Zhang, Tao Zhou, Wei Wu, Hua Xie, Hongru Zhu, Guoxu Zhou, Andrzej Cichocki

Figure 1 for Improving EEG Decoding via Clustering-based Multi-task Feature Learning

Figure 2 for Improving EEG Decoding via Clustering-based Multi-task Feature Learning

Figure 3 for Improving EEG Decoding via Clustering-based Multi-task Feature Learning

Figure 4 for Improving EEG Decoding via Clustering-based Multi-task Feature Learning

Abstract:Accurate electroencephalogram (EEG) pattern decoding for specific mental tasks is one of the key steps for the development of brain-computer interface (BCI), which is quite challenging due to the considerably low signal-to-noise ratio of EEG collected at the brain scalp. Machine learning provides a promising technique to optimize EEG patterns toward better decoding accuracy. However, existing algorithms do not effectively explore the underlying data structure capturing the true EEG sample distribution, and hence can only yield a suboptimal decoding accuracy. To uncover the intrinsic distribution structure of EEG data, we propose a clustering-based multi-task feature learning algorithm for improved EEG pattern decoding. Specifically, we perform affinity propagation-based clustering to explore the subclasses (i.e., clusters) in each of the original classes, and then assign each subclass a unique label based on a one-versus-all encoding strategy. With the encoded label matrix, we devise a novel multi-task learning algorithm by exploiting the subclass relationship to jointly optimize the EEG pattern features from the uncovered subclasses. We then train a linear support vector machine with the optimized features for EEG pattern decoding. Extensive experimental studies are conducted on three EEG datasets to validate the effectiveness of our algorithm in comparison with other state-of-the-art approaches. The improved experimental results demonstrate the outstanding superiority of our algorithm, suggesting its prominent performance for EEG pattern decoding in BCI applications.

Via

Access Paper or Ask Questions

Deep Learning in EEG: Advance of the Last Ten-Year Critical Period

Nov 22, 2020

Shu Gong, Kaibo Xing, Andrzej Cichocki, Junhua Li

Figure 1 for Deep Learning in EEG: Advance of the Last Ten-Year Critical Period

Figure 2 for Deep Learning in EEG: Advance of the Last Ten-Year Critical Period

Figure 3 for Deep Learning in EEG: Advance of the Last Ten-Year Critical Period

Figure 4 for Deep Learning in EEG: Advance of the Last Ten-Year Critical Period

Abstract:Deep learning has achieved excellent performance in a wide range of domains, especially in speech recognition and computer vision. Relatively less work has been done for EEG, but there is still significant progress attained in the last decade. Due to the lack of a comprehensive survey for deep learning in EEG, we attempt to summarize recent progress to provide an overview, as well as perspectives for future developments. We first briefly mention the artifacts removal for EEG signal and then introduce deep learning models that have been utilized in EEG processing and classification. Subsequently, the applications of deep learning in EEG are reviewed by categorizing them into groups such as brain-computer interface, disease detection, and emotion recognition. They are followed by the discussion, in which the pros and cons of deep learning are presented and future directions and challenges for deep learning in EEG are proposed. We hope that this paper could serve as a summary of past work for deep learning in EEG and the beginning of further developments and achievements of EEG studies based on deep learning.

Via

Access Paper or Ask Questions

On Multiple Intelligences and Learning Styles for Artificial Intelligence Systems: Future Research Trends in AI with a Human Face?

Aug 30, 2020

Andrzej Cichocki

Figure 1 for On Multiple Intelligences and Learning Styles for Artificial Intelligence Systems: Future Research Trends in AI with a Human Face?

Abstract:This article discusses recent trends and concepts in developing new kinds of artificial intelligence (AI) systems which relate to complex facets and different types of human intelligence, especially social, emotional, attentional and ethical intelligence, which to date have been under-discussed. We describe various aspects of multiple human intelligence and learning styles, which may impact on a variety of AI problem domains. Using the concept of multiple intelligence rather than a single type of intelligence, we categorize and provide working definitions of various AI depending on their cognitive skills or capacities. Future AI systems will be able not only to communicate with human actors and each other, but also to efficiently exchange knowledge with abilities of cooperation, collaboration and even co-creating something new and valuable and have meta-learning capacities. Multi-agent systems such as these can be used to solve problems that would be difficult to solve by any individual intelligent agent.

* 19 Figures, 27 pages

Via

Access Paper or Ask Questions

Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Aug 12, 2020

Anh-Huy Phan, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov, Julia Gusak, Petr Tichavsky, Valeriy Glukhov, Ivan Oseledets, Andrzej Cichocki

Figure 1 for Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Figure 2 for Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Figure 3 for Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Figure 4 for Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Abstract:Most state of the art deep neural networks are overparameterized and exhibit a high computational cost. A straightforward approach to this problem is to replace convolutional kernels with its low-rank tensor approximations, whereas the Canonical Polyadic tensor Decomposition is one of the most suited models. However, fitting the convolutional tensors by numerical optimization algorithms often encounters diverging components, i.e., extremely large rank-one tensors but canceling each other. Such degeneracy often causes the non-interpretable result and numerical instability for the neural network fine-tuning. This paper is the first study on degeneracy in the tensor decomposition of convolutional kernels. We present a novel method, which can stabilize the low-rank approximation of convolutional kernels and ensure efficient compression while preserving the high-quality performance of the neural networks. We evaluate our approach on popular CNN architectures for image classification and show that our method results in much lower accuracy degradation and provides consistent performance.

* This paper is accepted to ECCV2020

Via

Access Paper or Ask Questions

Towards Understanding Normalization in Neural ODEs

Apr 27, 2020

Julia Gusak, Larisa Markeeva, Talgat Daulbaev, Alexandr Katrutsa, Andrzej Cichocki, Ivan Oseledets

Figure 1 for Towards Understanding Normalization in Neural ODEs

Abstract:Normalization is an important and vastly investigated technique in deep learning. However, its role for Ordinary Differential Equation based networks (neural ODEs) is still poorly understood. This paper investigates how different normalization techniques affect the performance of neural ODEs. Particularly, we show that it is possible to achieve 93% accuracy in the CIFAR-10 classification task, and to the best of our knowledge, this is the highest reported accuracy among neural ODEs tested on this problem.

Via

Access Paper or Ask Questions

Interpolated Adjoint Method for Neural ODEs

Mar 11, 2020

Talgat Daulbaev, Alexandr Katrutsa, Larisa Markeeva, Julia Gusak, Andrzej Cichocki, Ivan Oseledets

Figure 1 for Interpolated Adjoint Method for Neural ODEs

Figure 2 for Interpolated Adjoint Method for Neural ODEs

Figure 3 for Interpolated Adjoint Method for Neural ODEs

Figure 4 for Interpolated Adjoint Method for Neural ODEs

Abstract:In this paper, we propose a method, which allows us to alleviate or completely avoid the notorious problem of numerical instability and stiffness of the adjoint method for training neural ODE. On the backward pass, we propose to use the machinery of smooth function interpolation to restore the trajectory obtained during the forward integration. We show the viability of our approach, both in theory and practice.

Via

Access Paper or Ask Questions

Using Reinforcement Learning in the Algorithmic Trading Problem

Feb 26, 2020

Evgeny Ponomarev, Ivan Oseledets, Andrzej Cichocki

Figure 1 for Using Reinforcement Learning in the Algorithmic Trading Problem

Figure 2 for Using Reinforcement Learning in the Algorithmic Trading Problem

Figure 3 for Using Reinforcement Learning in the Algorithmic Trading Problem

Figure 4 for Using Reinforcement Learning in the Algorithmic Trading Problem

Abstract:The development of reinforced learning methods has extended application to many areas including algorithmic trading. In this paper trading on the stock exchange is interpreted into a game with a Markov property consisting of states, actions, and rewards. A system for trading the fixed volume of a financial instrument is proposed and experimentally tested; this is based on the asynchronous advantage actor-critic method with the use of several neural network architectures. The application of recurrent layers in this approach is investigated. The experiments were performed on real anonymized data. The best architecture demonstrated a trading strategy for the RTS Index futures (MOEX:RTSI) with a profitability of 66% per annum accounting for commission. The project source code is available via the following link: http://github.com/evgps/a3c_trading.

* ISSN 1064-2269, Journal of Communications Technology and Electronics, 2019, Vol. 64, No. 12, pp. 1450-1457

Via

Access Paper or Ask Questions

Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

Feb 25, 2020

Qiquan Shi, Jiaming Yin, Jiajun Cai, Andrzej Cichocki, Tatsuya Yokota, Lei Chen, Mingxuan Yuan, Jia Zeng

Figure 1 for Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

Figure 2 for Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

Figure 3 for Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

Figure 4 for Block Hankel Tensor ARIMA for Multiple Short Time Series Forecasting

Abstract:This work proposes a novel approach for multiple time series forecasting. At first, multi-way delay embedding transform (MDT) is employed to represent time series as low-rank block Hankel tensors (BHT). Then, the higher-order tensors are projected to compressed core tensors by applying Tucker decomposition. At the same time, the generalized tensor Autoregressive Integrated Moving Average (ARIMA) is explicitly used on consecutive core tensors to predict future samples. In this manner, the proposed approach tactically incorporates the unique advantages of MDT tensorization (to exploit mutual correlations) and tensor ARIMA coupled with low-rank Tucker decomposition into a unified framework. This framework exploits the low-rank structure of block Hankel tensors in the embedded space and captures the intrinsic correlations among multiple TS, which thus can improve the forecasting results, especially for multiple short time series. Experiments conducted on three public datasets and two industrial datasets verify that the proposed BHT-ARIMA effectively improves forecasting accuracy and reduces computational cost compared with the state-of-the-art methods.

* Accepted by AAAI 2020

Via

Access Paper or Ask Questions