Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Raymond K. W. Wong

Matrix Completion under Low-Rank Missing Mechanism

Dec 19, 2018
Xiaojun Mao, Raymond K. W. Wong, Song Xi Chen

Figure 1 for Matrix Completion under Low-Rank Missing Mechanism

Figure 2 for Matrix Completion under Low-Rank Missing Mechanism

Figure 3 for Matrix Completion under Low-Rank Missing Mechanism

Figure 4 for Matrix Completion under Low-Rank Missing Mechanism

This paper investigates the problem of matrix completion from corrupted data, when a low-rank missing mechanism is considered. The better recovery of missing mechanism often helps completing the unobserved entries of the high-dimensional target matrix. Instead of the widely used uniform risk function, we weight the observations by inverse probabilities of observation, which are estimated through a specifically designed high-dimensional estimation procedure. Asymptotic convergence rates of the proposed estimators for both the observation probabilities and the target matrix are studied. The empirical performance of the proposed methodology is illustrated via both numerical experiments and a real data application.

* 31 pages, 2 figures

Via

Access Paper or Ask Questions

Autoencoders Learn Generative Linear Models

Jun 02, 2018
Thanh V. Nguyen, Raymond K. W. Wong, Chinmay Hegde

Figure 1 for Autoencoders Learn Generative Linear Models

Figure 2 for Autoencoders Learn Generative Linear Models

Recent progress in learning theory has led to the emergence of provable algorithms for training certain families of neural networks. Under the assumption that the training data is sampled from a suitable generative model, the weights of the trained networks obtained by these algorithms recover (either exactly or approximately) the generative model parameters. However, the large majority of these results are only applicable to supervised learning architectures. In this paper, we complement this line of work by providing a series of results for unsupervised learning with neural networks. Specifically, we study the familiar setting of shallow autoencoder architectures with shared weights. We focus on three generative models for the data: (i) the mixture-of-gaussians model, (ii) the sparse coding model, and (iii) the non-negative sparsity model. All three models are widely studied in the machine learning literature. For each of these models, we rigorously prove that under suitable choices of hyperparameters, architectures, and initialization, the autoencoder weights learned by gradient descent % -based training can successfully recover the parameters of the corresponding model. To our knowledge, this is the first result that rigorously studies the dynamics of gradient descent for weight-sharing autoencoders. Our analysis can be viewed as theoretical evidence that shallow autoencoder modules indeed can be used as unsupervised feature training mechanisms for a wide range of datasets, and may shed insight on how to train larger stacked architectures with autoencoders as basic building blocks.

* 19 pages

Via

Access Paper or Ask Questions

Matrix Completion with Noisy Entries and Outliers

Dec 27, 2017
Raymond K. W. Wong, Thomas C. M. Lee

Figure 1 for Matrix Completion with Noisy Entries and Outliers

Figure 2 for Matrix Completion with Noisy Entries and Outliers

Figure 3 for Matrix Completion with Noisy Entries and Outliers

Figure 4 for Matrix Completion with Noisy Entries and Outliers

This paper considers the problem of matrix completion when the observed entries are noisy and contain outliers. It begins with introducing a new optimization criterion for which the recovered matrix is defined as its solution. This criterion uses the celebrated Huber function from the robust statistics literature to downweigh the effects of outliers. A practical algorithm is developed to solve the optimization involved. This algorithm is fast, straightforward to implement, and monotonic convergent. Furthermore, the proposed methodology is theoretically shown to be stable in a well defined sense. Its promising empirical performance is demonstrated via a sequence of simulation experiments, including image inpainting.

* 33 pages, 2 figures

Via

Access Paper or Ask Questions

Provably Accurate Double-Sparse Coding

Dec 12, 2017
Thanh V. Nguyen, Raymond K. W. Wong, Chinmay Hegde

Figure 1 for Provably Accurate Double-Sparse Coding

Figure 2 for Provably Accurate Double-Sparse Coding

Figure 3 for Provably Accurate Double-Sparse Coding

Sparse coding is a crucial subroutine in algorithms for various signal processing, deep learning, and other machine learning applications. The central goal is to learn an overcomplete dictionary that can sparsely represent a given input dataset. However, a key challenge is that storage, transmission, and processing of the learned dictionary can be untenably high if the data dimension is high. In this paper, we consider the double-sparsity model introduced by Rubinstein et al. (2010b) where the dictionary itself is the product of a fixed, known basis and a data-adaptive sparse component. First, we introduce a simple algorithm for double-sparse coding that can be amenable to efficient implementation via neural architectures. Second, we theoretically analyze its performance and demonstrate asymptotic sample complexity and running time benefits over existing (provable) approaches for sparse coding. To our knowledge, our work introduces the first computationally efficient algorithm for double-sparse coding that enjoys rigorous statistical guarantees. Finally, we support our analysis via several numerical experiments on simulated data, confirming that our method can indeed be useful in problem sizes encountered in practical applications.

* 40 pages. An abbreviated conference version appears at AAAI 2018

Via

Access Paper or Ask Questions