Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Information": models, code, and papers

StockBabble: A Conversational Financial Agent to support Stock Market Investors

Jun 15, 2021
Suraj Sharma, Joseph Brennan, Jason R. C. Nurse

Figure 1 for StockBabble: A Conversational Financial Agent to support Stock Market Investors

Figure 2 for StockBabble: A Conversational Financial Agent to support Stock Market Investors

Figure 3 for StockBabble: A Conversational Financial Agent to support Stock Market Investors

We introduce StockBabble, a conversational agent designed to support understanding and engagement with the stock market. StockBabble's value and novelty is in its ability to empower retail investors -- many of which may be new to investing -- and supplement their informational needs using a user-friendly agent. Users have the ability to query information on companies to retrieve a general and financial overview of a stock, including accessing the latest news and trading recommendations. They can also request charts which contain live prices and technical investment indicators, and add shares to a personal portfolio to allow performance monitoring over time. To evaluate our agent's potential, we conducted a user study with 15 participants. In total, 73% (11/15) of respondents said that they felt more confident in investing after using StockBabble, and all 15 would consider recommending it to others. These results are encouraging and suggest a wider appeal for such agents. Moreover, we believe this research can help to inform the design and development of future intelligent, financial personal assistants.

* CUI 2021 - 3rd Conference on Conversational User Interfaces

Via

Access Paper or Ask Questions

CRFL: Certifiably Robust Federated Learning against Backdoor Attacks

Jun 15, 2021
Chulin Xie, Minghao Chen, Pin-Yu Chen, Bo Li

Figure 1 for CRFL: Certifiably Robust Federated Learning against Backdoor Attacks

Figure 2 for CRFL: Certifiably Robust Federated Learning against Backdoor Attacks

Figure 3 for CRFL: Certifiably Robust Federated Learning against Backdoor Attacks

Figure 4 for CRFL: Certifiably Robust Federated Learning against Backdoor Attacks

Federated Learning (FL) as a distributed learning paradigm that aggregates information from diverse clients to train a shared global model, has demonstrated great success. However, malicious clients can perform poisoning attacks and model replacement to introduce backdoors into the trained global model. Although there have been intensive studies designing robust aggregation methods and empirical robust federated training protocols against backdoors, existing approaches lack robustness certification. This paper provides the first general framework, Certifiably Robust Federated Learning (CRFL), to train certifiably robust FL models against backdoors. Our method exploits clipping and smoothing on model parameters to control the global model smoothness, which yields a sample-wise robustness certification on backdoors with limited magnitude. Our certification also specifies the relation to federated learning parameters, such as poisoning ratio on instance level, number of attackers, and training iterations. Practically, we conduct comprehensive experiments across a range of federated datasets, and provide the first benchmark for certified robustness against backdoor attacks in federated learning. Our code is available at https://github.com/AI-secure/CRFL.

* ICML 2021

Via

Access Paper or Ask Questions

Augmented Tensor Decomposition with Stochastic Optimization

Jun 22, 2021
Chaoqi Yang, Cheng Qian, Navjot Singh, Cao Xiao, M Brandon Westover, Edgar Solomonik, Jimeng Sun

Figure 1 for Augmented Tensor Decomposition with Stochastic Optimization

Figure 2 for Augmented Tensor Decomposition with Stochastic Optimization

Figure 3 for Augmented Tensor Decomposition with Stochastic Optimization

Figure 4 for Augmented Tensor Decomposition with Stochastic Optimization

Tensor decompositions are powerful tools for dimensionality reduction and feature interpretation of multidimensional data such as signals. Existing tensor decomposition objectives (e.g., Frobenius norm) are designed for fitting raw data under statistical assumptions, which may not align with downstream classification tasks. Also, real-world tensor data are usually high-ordered and have large dimensions with millions or billions of entries. Thus, it is expensive to decompose the whole tensor with traditional algorithms. In practice, raw tensor data also contains redundant information while data augmentation techniques may be used to smooth out noise in samples. This paper addresses the above challenges by proposing augmented tensor decomposition (ATD), which effectively incorporates data augmentations to boost downstream classification. To reduce the memory footprint of the decomposition, we propose a stochastic algorithm that updates the factor matrices in a batch fashion. We evaluate ATD on multiple signal datasets. It shows comparable or better performance (e.g., up to 15% in accuracy) over self-supervised and autoencoder baselines with less than 5% of model parameters, achieves 0.6% ~ 1.3% accuracy gain over other tensor-based baselines, and reduces the memory footprint by 9X when compared to standard tensor decomposition algorithms.

* This paper proposes augmented tensor decomposition framework and use stochastic algorithm for optimization

Via

Access Paper or Ask Questions

Graph-based Label Propagation for Semi-Supervised Speaker Identification

Jun 15, 2021
Long Chen, Venkatesh Ravichandran, Andreas Stolcke

Figure 1 for Graph-based Label Propagation for Semi-Supervised Speaker Identification

Figure 2 for Graph-based Label Propagation for Semi-Supervised Speaker Identification

Figure 3 for Graph-based Label Propagation for Semi-Supervised Speaker Identification

Figure 4 for Graph-based Label Propagation for Semi-Supervised Speaker Identification

Speaker identification in the household scenario (e.g., for smart speakers) is typically based on only a few enrollment utterances but a much larger set of unlabeled data, suggesting semisupervised learning to improve speaker profiles. We propose a graph-based semi-supervised learning approach for speaker identification in the household scenario, to leverage the unlabeled speech samples. In contrast to most of the works in speaker recognition that focus on speaker-discriminative embeddings, this work focuses on speaker label inference (scoring). Given a pre-trained embedding extractor, graph-based learning allows us to integrate information about both labeled and unlabeled utterances. Considering each utterance as a graph node, we represent pairwise utterance similarity scores as edge weights. Graphs are constructed per household, and speaker identities are propagated to unlabeled nodes to optimize a global consistency criterion. We show in experiments on the VoxCeleb dataset that this approach makes effective use of unlabeled data and improves speaker identification accuracy compared to two state-of-the-art scoring methods as well as their semi-supervised variants based on pseudo-labels.

* To appear in Interspeech 2021

Via

Access Paper or Ask Questions

Error Loss Networks

Jun 07, 2021
Badong Chen, Yunfei Zheng, Pengju Ren

A novel model called error loss network (ELN) is proposed to build an error loss function for supervised learning. The ELN is in structure similar to a RBF neural network, but its input is an error sample and output is a loss corresponding to that error sample. That means the nonlinear input-output mapper of ELN creates an error loss function. The proposed ELN provides a unified model for a large class of error loss functions, which includes some information theoretic learning (ITL) loss functions as special cases. The activation function, weight parameters and network size of the ELN can be predetermined or learned from the error samples. On this basis, we propose a new machine learning paradigm where the learning process is divided into two stages: first, learning a loss function using an ELN; second, using the learned loss function to continue to perform the learning. Experimental results are presented to demonstrate the desirable performance of the new method.

Via

Access Paper or Ask Questions

What Makes for Hierarchical Vision Transformer?

Jul 05, 2021
Yuxin Fang, Xinggang Wang, Rui Wu, Jianwei Niu, Wenyu Liu

Figure 1 for What Makes for Hierarchical Vision Transformer?

Figure 2 for What Makes for Hierarchical Vision Transformer?

Figure 3 for What Makes for Hierarchical Vision Transformer?

Figure 4 for What Makes for Hierarchical Vision Transformer?

Recent studies show that hierarchical Vision Transformer with interleaved non-overlapped intra window self-attention \& shifted window self-attention is able to achieve state-of-the-art performance in various visual recognition tasks and challenges CNN's dense sliding window paradigm. Most follow-up works try to replace shifted window operation with other kinds of cross window communication while treating self-attention as the de-facto standard for intra window information aggregation. In this short preprint, we question whether self-attention is the only choice for hierarchical Vision Transformer to attain strong performance, and what makes for hierarchical Vision Transformer? We replace self-attention layers in Swin Transformer and Shuffle Transformer with simple linear mapping and keep other components unchanged. The resulting architecture with 25.4M parameters and 4.2G FLOPs achieves 80.5\% Top-1 accuracy, compared to 81.3\% for Swin Transformer with 28.3M parameters and 4.5G FLOPs. We also experiment with other alternatives to self-attention for context aggregation inside each non-overlapped window, which all give similar competitive results under the same architecture. Our study reveals that the \textbf{macro architecture} of Swin model families (i.e., interleaved intra window \& cross window communications), other than specific aggregation layers or specific means of cross window communication, may be more responsible for its strong performance and is the real challenger to CNN's dense sliding window paradigm.

* Preprint. Work in progress

Via

Access Paper or Ask Questions

CAIM: Cooperative Angle of Arrival Estimation using the Ising Method

Apr 27, 2021
Shiva Akbari, Shahrokh Valaee

Figure 1 for CAIM: Cooperative Angle of Arrival Estimation using the Ising Method

Figure 2 for CAIM: Cooperative Angle of Arrival Estimation using the Ising Method

Figure 3 for CAIM: Cooperative Angle of Arrival Estimation using the Ising Method

Figure 4 for CAIM: Cooperative Angle of Arrival Estimation using the Ising Method

This paper proposes a cooperative angle-of-arrival(AoA) estimation, taking advantage of co-processing channel state information (CSI) from a group of access points that receive signals of the same source. Since received signals are sparse, we use Compressive Sensing (CS) to address the AoA estimation problem. We formulate this problem as a penalized l0-norm minimization, reformulate it as an Ising energy problem, and solve it using Markov Chain Monte Carlo (MCMC). Simulation results show that our proposed method outperforms the existing methods in the literature.

Via

Access Paper or Ask Questions

Probabilistic Inference for Structural Health Monitoring: New Modes of Learning from Data

Mar 02, 2021
Lawrence A. Bull, Paul Gardner, Timothy J. Rogers, Elizabeth J. Cross, Nikolaos Dervilis, Keith Worden

Figure 1 for Probabilistic Inference for Structural Health Monitoring: New Modes of Learning from Data

Figure 2 for Probabilistic Inference for Structural Health Monitoring: New Modes of Learning from Data

Figure 3 for Probabilistic Inference for Structural Health Monitoring: New Modes of Learning from Data

Figure 4 for Probabilistic Inference for Structural Health Monitoring: New Modes of Learning from Data

In data-driven SHM, the signals recorded from systems in operation can be noisy and incomplete. Data corresponding to each of the operational, environmental, and damage states are rarely available a priori; furthermore, labelling to describe the measurements is often unavailable. In consequence, the algorithms used to implement SHM should be robust and adaptive, while accommodating for missing information in the training-data -- such that new information can be included if it becomes available. By reviewing novel techniques for statistical learning (introduced in previous work), it is argued that probabilistic algorithms offer a natural solution to the modelling of SHM data in practice. In three case-studies, probabilistic methods are adapted for applications to SHM signals -- including semi-supervised learning, active learning, and multi-task learning.

* ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part A: Civil Engineering 7.1 (2021): 03120003
* This material may be downloaded for personal use only. Any other use requires prior permission of the American Society of Civil Engineers. This material may be found at https://doi.org/10.1061/AJRUA6.0001106

Via

Access Paper or Ask Questions

Graph Similarity Description: How Are These Graphs Similar?

May 29, 2021
Corinna Coupette, Jilles Vreeken

Figure 1 for Graph Similarity Description: How Are These Graphs Similar?

Figure 2 for Graph Similarity Description: How Are These Graphs Similar?

Figure 3 for Graph Similarity Description: How Are These Graphs Similar?

Figure 4 for Graph Similarity Description: How Are These Graphs Similar?

How do social networks differ across platforms? How do information networks change over time? Answering questions like these requires us to compare two or more graphs. This task is commonly treated as a measurement problem, but numerical answers give limited insight. Here, we argue that if the goal is to gain understanding, we should treat graph similarity assessment as a description problem instead. We formalize this problem as a model selection task using the Minimum Description Length principle, capturing the similarity of the input graphs in a common model and the differences between them in transformations to individual models. To discover good models, we propose Momo, which breaks the problem into two parts and introduces efficient algorithms for each. Through an extensive set of experiments on a wide range of synthetic and real-world graphs, we confirm that Momo works well in practice.

* 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021); 9+2 pages, 9+4 figures

Via

Access Paper or Ask Questions

Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

Jun 07, 2021
Zhen Chen, Victor Churchill, Kailiang Wu, Dongbin Xiu

Figure 1 for Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

Figure 2 for Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

Figure 3 for Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

Figure 4 for Deep Neural Network Modeling of Unknown Partial Differential Equations in Nodal Space

We present a numerical framework for deep neural network (DNN) modeling of unknown time-dependent partial differential equations (PDE) using their trajectory data. Unlike the recent work of [Wu and Xiu, J. Comput. Phys. 2020], where the learning takes place in modal/Fourier space, the current method conducts the learning and modeling in physical space and uses measurement data as nodal values. We present a DNN structure that has a direct correspondence to the evolution operator of the underlying PDE, thus establishing the existence of the DNN model. The DNN model also does not require any geometric information of the data nodes. Consequently, a trained DNN defines a predictive model for the underlying unknown PDE over structureless grids. A set of examples, including linear and nonlinear scalar PDE, system of PDEs, in both one dimension and two dimensions, over structured and unstructured grids, are presented to demonstrate the effectiveness of the proposed DNN modeling. Extension to other equations such as differential-integral equations is also discussed.

Via

Access Paper or Ask Questions