Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Time": models, code, and papers

Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Feb 28, 2022
Qi Liu, Bo Yang, Zhaojian Wang, Dafeng Zhu, Xinyi Wang, Kai Ma, Xinping Guan

Figure 1 for Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Figure 2 for Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Figure 3 for Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Figure 4 for Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

Due to the different losses caused by various photovoltaic (PV) array faults, accurate diagnosis of fault types is becoming increasingly important. Compared with a single one, multiple PV stations collect sufficient fault samples, but their data is not allowed to be shared directly due to potential conflicts of interest. Therefore, federated learning can be exploited to train a collaborative fault diagnosis model. However, the modeling efficiency is seriously affected by the model update mechanism since each PV station has a different computing capability and amount of data. Moreover, for the safe and stable operation of the PV system, the robustness of collaborative modeling must be guaranteed rather than simply being processed on a central server. To address these challenges, a novel asynchronous decentralized federated learning (ADFL) framework is proposed. Each PV station not only trains its local model but also participates in collaborative fault diagnosis by exchanging model parameters to improve the generalization without losing accuracy. The global model is aggregated distributedly to avoid central node failure. By designing the asynchronous update scheme, the communication overhead and training time are greatly reduced. Both the experiments and numerical simulations are carried out to verify the effectiveness of the proposed method.

Via

Access Paper or Ask Questions

Evaluating data augmentation for financial time series classification

Oct 28, 2020
Elizabeth Fons, Paula Dawson, Xiao-jun Zeng, John Keane, Alexandros Iosifidis

Figure 1 for Evaluating data augmentation for financial time series classification

Figure 2 for Evaluating data augmentation for financial time series classification

Figure 3 for Evaluating data augmentation for financial time series classification

Figure 4 for Evaluating data augmentation for financial time series classification

Data augmentation methods in combination with deep neural networks have been used extensively in computer vision on classification tasks, achieving great success; however, their use in time series classification is still at an early stage. This is even more so in the field of financial prediction, where data tends to be small, noisy and non-stationary. In this paper we evaluate several augmentation methods applied to stocks datasets using two state-of-the-art deep learning models. The results show that several augmentation methods significantly improve financial performance when used in combination with a trading strategy. For a relatively small dataset ($\approx30K$ samples), augmentation methods achieve up to $400\%$ improvement in risk adjusted return performance; for a larger stock dataset ($\approx300K$ samples), results show up to $40\%$ improvement.

Via

Access Paper or Ask Questions

Exploring Structural Sparsity in Neural Image Compression

Feb 10, 2022
Shanzhi Yin, Fanyang Meng, Wen Tan, Chao Li, Youneng Bao, Yongsheng Liang, Wei Liu

Figure 1 for Exploring Structural Sparsity in Neural Image Compression

Figure 2 for Exploring Structural Sparsity in Neural Image Compression

Figure 3 for Exploring Structural Sparsity in Neural Image Compression

Figure 4 for Exploring Structural Sparsity in Neural Image Compression

Neural image compression have reached or out-performed traditional methods (such as JPEG, BPG, WebP). However,their sophisticated network structures with cascaded convolution layers bring heavy computational burden for practical deployment. In this paper, we explore the structural sparsity in neural image compression network to obtain real-time acceleration without any specialized hardware design or algorithm. We propose a simple plug-in adaptive binary channel masking(ABCM) to judge the importance of each convolution channel and introduce sparsity during training. During inference, the unimportant channels are pruned to obtain slimmer network and less computation. We implement our method into three neural image compression networks with different entropy models to verify its effectiveness and generalization, the experiment results show that up to 7x computation reduction and 3x acceleration can be achieved with negligible performance drop.

* 5 pages, 5 figures, submitted to ICIP 2022

Via

Access Paper or Ask Questions

A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain

Mar 15, 2022
Kairui Bao, Wen Yao, Xiaoya Zhang, Wei Peng, Yu Li

Figure 1 for A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain

Figure 2 for A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain

Figure 3 for A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain

Figure 4 for A physics and data co-driven surrogate modeling approach for temperature field prediction on irregular geometric domain

In the whole aircraft structural optimization loop, thermal analysis plays a very important role. But it faces a severe computational burden when directly applying traditional numerical analysis tools, especially when each optimization involves repetitive parameter modification and thermal analysis followed. Recently, with the fast development of deep learning, several Convolutional Neural Network (CNN) surrogate models have been introduced to overcome this obstacle. However, for temperature field prediction on irregular geometric domains (TFP-IGD), CNN can hardly be competent since most of them stem from processing for regular images. To alleviate this difficulty, we propose a novel physics and data co-driven surrogate modeling method. First, after adapting the Bezier curve in geometric parameterization, a body-fitted coordinate mapping is introduced to generate coordinate transforms between the irregular physical plane and regular computational plane. Second, a physics-driven CNN surrogate with partial differential equation (PDE) residuals as a loss function is utilized for fast meshing (meshing surrogate); then, we present a data-driven surrogate model based on the multi-level reduced-order method, aiming to learn solutions of temperature field in the above regular computational plane (thermal surrogate). Finally, combining the grid position information provided by the meshing surrogate with the scalar temperature field information provided by the thermal surrogate (combined model), we reach an end-to-end surrogate model from geometric parameters to temperature field prediction on an irregular geometric domain. Numerical results demonstrate that our method can significantly improve accuracy prediction on a smaller dataset while reducing the training time when compared with other CNN methods.

Via

Access Paper or Ask Questions

Enhancing Door Detection for Autonomous Mobile Robots with Environment-Specific Data Collection

Mar 08, 2022
Michele Antonazzi, Matteo Luperto, Nicola Basilico, N. Alberto Borghese

Figure 1 for Enhancing Door Detection for Autonomous Mobile Robots with Environment-Specific Data Collection

Figure 2 for Enhancing Door Detection for Autonomous Mobile Robots with Environment-Specific Data Collection

Figure 3 for Enhancing Door Detection for Autonomous Mobile Robots with Environment-Specific Data Collection

Figure 4 for Enhancing Door Detection for Autonomous Mobile Robots with Environment-Specific Data Collection

Door detection represents a fundamental capability for autonomous mobile robots employed in tasks involving indoor navigation. Recognizing the presence of a door and its status (open or closed) can induce a remarkable impact on the navigation performance, especially for dynamic settings where doors can enable or disable passages, hence changing the actual topology of the map. In this work, we address the problem of building a door detector module for an autonomous mobile robot deployed in a long-term scenario, namely operating in the same environment for a long time, thus observing the same set of doors from different points of view. First, we show how the mainstream approach for door detection, based on object recognition, falls short in considering the constrained perception setup typical of a mobile robot. Hence, we devise a method to build a dataset of images taken from a robot's perspective and we exploit it to obtain a door detector based on an established deep-learning object-recognition method. We then exploit the long-term assumption of our scenario to qualify the model on the robot working environment via fine-tuning with additional images acquired in the deployment environment. Our experimental analysis shows how this method can achieve good performance and highlights a trade-off between costs and benefits of the fine-tuning approach.

* Preprint submitted for revision at IROS 2022

Via

Access Paper or Ask Questions

Improving Ant Colony Optimization Efficiency for Solving Large TSP Instances

Mar 04, 2022
Rafał Skinderowicz

Figure 1 for Improving Ant Colony Optimization Efficiency for Solving Large TSP Instances

Figure 2 for Improving Ant Colony Optimization Efficiency for Solving Large TSP Instances

Figure 3 for Improving Ant Colony Optimization Efficiency for Solving Large TSP Instances

Figure 4 for Improving Ant Colony Optimization Efficiency for Solving Large TSP Instances

Ant Colony Optimization (ACO) is a family of nature-inspired metaheuristics often applied to finding approximate solutions to difficult optimization problems. Despite being significantly faster than exact methods, the ACOs can still be prohibitively slow, especially if compared to basic problem-specific heuristics. As recent research has shown, it is possible to significantly improve the performance through algorithm refinements and careful parallel implementation benefiting from multi-core CPUs and dedicated accelerators. In this paper, we present a novel ACO variant, namely the Focused ACO (FACO). One of the core elements of the FACO is a mechanism for controlling the number of differences between a newly constructed and a selected previous solution. The mechanism results in a more focused search process, allowing to find improvements while preserving the quality of the existing solution. An additional benefit is a more efficient integration with a problem-specific local search. Computational study based on a range of the Traveling Salesman Problem instances shows that the FACO outperforms the state-of-the-art ACOs when solving large TSP instances. Specifically, the FACO required less than an hour of an 8-core commodity CPU time to find high-quality solutions (within 1% from the best-known results) for TSP Art Instances ranging from 100000 to 200000 nodes.

* Applied Soft Computing, 2022, 108653, ISSN 1568-4946
* 28 pages, 10 figures, 5 tables

Via

Access Paper or Ask Questions

Oracle-Efficient Online Learning for Beyond Worst-Case Adversaries

Mar 08, 2022
Nika Haghtalab, Yanjun Han, Abhishek Shetty, Kunhe Yang

Figure 1 for Oracle-Efficient Online Learning for Beyond Worst-Case Adversaries

In this paper, we study oracle-efficient algorithms for beyond worst-case analysis of online learning. We focus on two settings. First, the smoothed analysis setting of [RST11, HRS21] where an adversary is constrained to generating samples from distributions whose density is upper bounded by $1/\sigma$ times the uniform density. Second, the setting of $K$-hint transductive learning, where the learner is given access to $K$ hints per time step that are guaranteed to include the true instance. We give the first known oracle-efficient algorithms for both settings that depend only on the VC dimension of the class and parameters $\sigma$ and $K$ that capture the power of the adversary. {In particular, we achieve oracle-efficient regret bounds of $ O ( \sqrt{T d\sigma^{-1/2}} ) $} and $ O ( \sqrt{T d K } )$ respectively for these setting. For the smoothed analysis setting, our results give the first oracle-efficient algorithm for online learning with smoothed adversaries [HRS21]. This contrasts the computational separation between online learning with worst-case adversaries and offline learning established by [HK16]. Our algorithms also achieve improved bounds for worst-case setting with small domains. In particular, we give an oracle-efficient algorithm with regret of $O ( \sqrt{T(d \vert{\mathcal{X}})\vert^{1/2} })$, which is a refinement of the earlier $O ( \sqrt{T\vert{\mathcal{X}}\vert })$ bound by [DS16].

* Under submission

Via

Access Paper or Ask Questions

Noisy Low-rank Matrix Optimization: Geometry of Local Minima and Convergence Rate

Mar 08, 2022
Ziye Ma, Somayeh Sojoudi

Figure 1 for Noisy Low-rank Matrix Optimization: Geometry of Local Minima and Convergence Rate

Figure 2 for Noisy Low-rank Matrix Optimization: Geometry of Local Minima and Convergence Rate

This paper is concerned with low-rank matrix optimization, which has found a wide range of applications in machine learning. This problem in the special case of matrix sense has been studied extensively through the notion of Restricted Isometry Property (RIP), leading to a wealth of results on the geometric landscape of the problem and the convergence rate of common algorithms. However, the existing results are not able to handle the problem with a general objective function subject to noisy data. In this paper, we address this problem by developing a mathematical framework that can deal with random corruptions to general objective functions, where the noise model is arbitrary. We prove that as long as the RIP constant of the noiseless objective is less than $1/3$, any spurious local solution of the noisy optimization problem must be close to the ground truth solution. By working through the strict saddle property, we also show that an approximate solution can be found in polynomial time. We characterize the geometry of the spurious local minima of the problem in a local region around the ground truth in the case when the RIP constant is greater than $1/3$. This paper offers the first set of results on the global and local optimization landscapes of general low-rank optimization problems under arbitrary random corruptions.

Via

Access Paper or Ask Questions

ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

Mar 08, 2022
Robin Wang, Yibo Yang, Dacheng Tao

Figure 1 for ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

Figure 2 for ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

Figure 3 for ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

Figure 4 for ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

Point cloud classifiers with rotation robustness have been widely discussed in the 3D deep learning community. Most proposed methods either use rotation invariant descriptors as inputs or try to design rotation equivariant networks. However, robust models generated by these methods have limited performance under clean aligned datasets due to modifications on the original classifiers or input space. In this study, for the first time, we show that the rotation robustness of point cloud classifiers can also be acquired via adversarial training with better performance on both rotated and clean datasets. Specifically, our proposed framework named ART-Point regards the rotation of the point cloud as an attack and improves rotation robustness by training the classifier on inputs with Adversarial RoTations. We contribute an axis-wise rotation attack that uses back-propagated gradients of the pre-trained model to effectively find the adversarial rotations. To avoid model over-fitting on adversarial inputs, we construct rotation pools that leverage the transferability of adversarial rotations among samples to increase the diversity of training data. Moreover, we propose a fast one-step optimization to efficiently reach the final robust model. Experiments show that our proposed rotation attack achieves a high success rate and ART-Point can be used on most existing classifiers to improve the rotation robustness while showing better performance on clean datasets than state-of-the-art methods.

* CVPR 2022

Via

Access Paper or Ask Questions

Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Dec 29, 2021
Evgeny Bobrov, Sergey Troshin, Nadezhda Chirkova, Ekaterina Lobacheva, Sviatoslav Panchenko, Dmitry Vetrov, Dmitry Kropotov

Figure 1 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Figure 2 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Figure 3 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Figure 4 for Machine Learning Methods for Spectral Efficiency Prediction in Massive MIMO Systems

Channel decoding, channel detection, channel assessment, and resource management for wireless multiple-input multiple-output (MIMO) systems are all examples of problems where machine learning (ML) can be successfully applied. In this paper, we study several ML approaches to solve the problem of estimating the spectral efficiency (SE) value for a certain precoding scheme, preferably in the shortest possible time. The best results in terms of mean average percentage error (MAPE) are obtained with gradient boosting over sorted features, while linear models demonstrate worse prediction quality. Neural networks perform similarly to gradient boosting, but they are more resource- and time-consuming because of hyperparameter tuning and frequent retraining. We investigate the practical applicability of the proposed algorithms in a wide range of scenarios generated by the Quadriga simulator. In almost all scenarios, the MAPE achieved using gradient boosting and neural networks is less than 10\%.

* To appear in Optimization Methods & Software, 22 pages, 10 figures, 2 tables

Via

Access Paper or Ask Questions