Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ju Sun

Complete Dictionary Recovery over the Sphere II: Recovery by Riemannian Trust-region Method

Sep 01, 2016

Ju Sun, Qing Qu, John Wright

Figure 1 for Complete Dictionary Recovery over the Sphere II: Recovery by Riemannian Trust-region Method

Abstract:We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and finds numerous applications in modern signal processing and machine learning. We give the first efficient algorithm that provably recovers $\mathbf A_0$ when $\mathbf X_0$ has $O(n)$ nonzeros per column, under suitable probability model for $\mathbf X_0$. Our algorithmic pipeline centers around solving a certain nonconvex optimization problem with a spherical constraint, and hence is naturally phrased in the language of manifold optimization. In a companion paper (arXiv:1511.03607), we have showed that with high probability our nonconvex formulation has no "spurious" local minimizers and around any saddle point the objective function has a negative directional curvature. In this paper, we take advantage of the particular geometric structure, and describe a Riemannian trust region algorithm that provably converges to a local minimizer with from arbitrary initializations. Such minimizers give excellent approximations to rows of $\mathbf X_0$. The rows are then recovered by linear programming rounding and deflation.

* IEEE Trans. Information Theory, 63(2): 885 - 914 (2017)
* The second of two papers based on the report arXiv:1504.06785. Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

Via

Access Paper or Ask Questions

Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

Sep 01, 2016

Ju Sun, Qing Qu, John Wright

Figure 1 for Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

Figure 2 for Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

Figure 3 for Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

Figure 4 for Complete Dictionary Recovery over the Sphere I: Overview and the Geometric Picture

Abstract:We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb{R}^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals and finds numerous applications in modern signal processing and machine learning. We give the first efficient algorithm that provably recovers $\mathbf A_0$ when $\mathbf X_0$ has $O(n)$ nonzeros per column, under suitable probability model for $\mathbf X_0$. In contrast, prior results based on efficient algorithms either only guarantee recovery when $\mathbf X_0$ has $O(\sqrt{n})$ zeros per column, or require multiple rounds of SDP relaxation to work when $\mathbf X_0$ has $O(n^{1-\delta})$ nonzeros per column (for any constant $\delta \in (0, 1)$). } Our algorithmic pipeline centers around solving a certain nonconvex optimization problem with a spherical constraint. In this paper, we provide a geometric characterization of the objective landscape. In particular, we show that the problem is highly structured: with high probability, (1) there are no "spurious" local minimizers; and (2) around all saddle points the objective has a negative directional curvature. This distinctive structure makes the problem amenable to efficient optimization algorithms. In a companion paper (arXiv:1511.04777), we design a second-order trust-region algorithm over the sphere that provably converges to a local minimizer from arbitrary initializations, despite the presence of saddle points.

* IEEE Trans. Information Theory, 63(2): 853 - 884 (2017)
* Accepted by IEEE Transaction on Information Theory; revised according to the reviewers' comments

Via

Access Paper or Ask Questions

Finding a sparse vector in a subspace: Linear sparsity using alternating directions

Jul 20, 2016

Qing Qu, Ju Sun, John Wright

Figure 1 for Finding a sparse vector in a subspace: Linear sparsity using alternating directions

Figure 2 for Finding a sparse vector in a subspace: Linear sparsity using alternating directions

Figure 3 for Finding a sparse vector in a subspace: Linear sparsity using alternating directions

Abstract:Is it possible to find the sparsest vector (direction) in a generic subspace $\mathcal{S} \subseteq \mathbb{R}^p$ with $\mathrm{dim}(\mathcal{S})= n < p$? This problem can be considered a homogeneous variant of the sparse recovery problem, and finds connections to sparse dictionary learning, sparse PCA, and many other problems in signal processing and machine learning. In this paper, we focus on a **planted sparse model** for the subspace: the target sparse vector is embedded in an otherwise random subspace. Simple convex heuristics for this planted recovery problem provably break down when the fraction of nonzero entries in the target sparse vector substantially exceeds $O(1/\sqrt{n})$. In contrast, we exhibit a relatively simple nonconvex approach based on alternating directions, which provably succeeds even when the fraction of nonzero entries is $\Omega(1)$. To the best of our knowledge, this is the first practical algorithm to achieve linear scaling under the planted sparse model. Empirically, our proposed algorithm also succeeds in more challenging data models, e.g., sparse dictionary learning.

* IEEE Transaction on Information Theory, 62(10):5855 - 5880, 2016
* Accepted by IEEE Trans. Information Theory. The paper has been revised by the reviewers' comments. The proofs have been streamlined

Via

Access Paper or Ask Questions

When Are Nonconvex Problems Not Scary?

Apr 23, 2016

Ju Sun, Qing Qu, John Wright

Figure 1 for When Are Nonconvex Problems Not Scary?

Figure 2 for When Are Nonconvex Problems Not Scary?

Abstract:In this note, we focus on smooth nonconvex optimization problems that obey: (1) all local minimizers are also global; and (2) around any saddle point or local maximizer, the objective has a negative directional curvature. Concrete applications such as dictionary learning, generalized phase retrieval, and orthogonal tensor decomposition are known to induce such structures. We describe a second-order trust-region algorithm that provably converges to a global minimizer efficiently, without special initializations. Finally we highlight alternatives, and open problems in this direction.

* 6 pages, 3 figures. New examples on phase synchronization and community detection added; emphasis on all local minimizers being global added; exposition is polished. This is a concise expository article that avoids much technical rigor. We will make a separate submission with full technical details in future

Via

Access Paper or Ask Questions

Complete Dictionary Recovery over the Sphere

Nov 17, 2015

Ju Sun, Qing Qu, John Wright

Figure 1 for Complete Dictionary Recovery over the Sphere

Figure 2 for Complete Dictionary Recovery over the Sphere

Figure 3 for Complete Dictionary Recovery over the Sphere

Figure 4 for Complete Dictionary Recovery over the Sphere

Abstract:We consider the problem of recovering a complete (i.e., square and invertible) matrix $\mathbf A_0$, from $\mathbf Y \in \mathbb R^{n \times p}$ with $\mathbf Y = \mathbf A_0 \mathbf X_0$, provided $\mathbf X_0$ is sufficiently sparse. This recovery problem is central to the theoretical understanding of dictionary learning, which seeks a sparse representation for a collection of input signals, and finds numerous applications in modern signal processing and machine learning. We give the first efficient algorithm that provably recovers $\mathbf A_0$ when $\mathbf X_0$ has $O(n)$ nonzeros per column, under suitable probability model for $\mathbf X_0$. In contrast, prior results based on efficient algorithms provide recovery guarantees when $\mathbf X_0$ has only $O(n^{1-\delta})$ nonzeros per column for any constant $\delta \in (0, 1)$. Our algorithmic pipeline centers around solving a certain nonconvex optimization problem with a spherical constraint, and hence is naturally phrased in the language of manifold optimization. To show this apparently hard problem is tractable, we first provide a geometric characterization of the high-dimensional objective landscape, which shows that with high probability there are no "spurious" local minima. This particular geometric structure allows us to design a Riemannian trust region algorithm over the sphere that provably converges to one local minimizer with an arbitrary initialization, despite the presence of saddle points. The geometric approach we develop here may also shed light on other problems arising from nonconvex recovery of structured signals.

* 104 pages, 5 figures. Due to length constraint of publication, this long paper are subsequently divided into two papers (arXiv:1511.03607 and arXiv:1511.04777). Further updates will be made only to the two papers

Via

Access Paper or Ask Questions

Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition

Mar 06, 2014

Ju Sun, Yuqian Zhang, John Wright

$Figure 1 for Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition$

$Figure 2 for Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition$

$Figure 3 for Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition$

$Figure 4 for Efficient Point-to-Subspace Query in $\ell^1$ with Application to Robust Object Instance Recognition$

Abstract:Motivated by vision tasks such as robust face and object recognition, we consider the following general problem: given a collection of low-dimensional linear subspaces in a high-dimensional ambient (image) space, and a query point (image), efficiently determine the nearest subspace to the query in $\ell^1$ distance. In contrast to the naive exhaustive search which entails large-scale linear programs, we show that the computational burden can be cut down significantly by a simple two-stage algorithm: (1) projecting the query and data-base subspaces into lower-dimensional space by random Cauchy matrix, and solving small-scale distance evaluations (linear programs) in the projection space to locate candidate nearest; (2) with few candidates upon independent repetition of (1), getting back to the high-dimensional space and performing exhaustive search. To preserve the identity of the nearest subspace with nontrivial probability, the projection dimension typically is low-order polynomial of the subspace dimension multiplied by logarithm of number of the subspaces (Theorem 2.1). The reduced dimensionality and hence complexity renders the proposed algorithm particularly relevant to vision application such as robust face and object instance recognition that we investigate empirically.

* SIAM Journal on Imaging Sciences, 7(4):2105 - 2138, 2014
* Revised based on reviewers' feedback; one new experiment on synthesized data added; one section discussing the speed up added

Via

Access Paper or Ask Questions

Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision

Nov 05, 2012

Ju Sun, Yuqian Zhang, John Wright

$Figure 1 for Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision$

$Figure 2 for Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision$

$Figure 3 for Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision$

$Figure 4 for Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision$

* To appear in NIPS workshop on big learning, 2012

Via

Access Paper or Ask Questions

Robust Recovery of Subspace Structures by Low-Rank Representation

May 06, 2012

Guangcan Liu, Zhouchen Lin, Shuicheng Yan, Ju Sun, Yong Yu, Yi Ma

Figure 1 for Robust Recovery of Subspace Structures by Low-Rank Representation

Figure 2 for Robust Recovery of Subspace Structures by Low-Rank Representation

Figure 3 for Robust Recovery of Subspace Structures by Low-Rank Representation

Figure 4 for Robust Recovery of Subspace Structures by Low-Rank Representation

Abstract:In this work we address the subspace recovery problem. Given a set of data samples (vectors) approximately drawn from a union of multiple subspaces, our goal is to segment the samples into their respective subspaces and correct the possible errors as well. To this end, we propose a novel method termed Low-Rank Representation (LRR), which seeks the lowest-rank representation among all the candidates that can represent the data samples as linear combinations of the bases in a given dictionary. It is shown that LRR well solves the subspace recovery problem: when the data is clean, we prove that LRR exactly captures the true subspace structures; for the data contaminated by outliers, we prove that under certain conditions LRR can exactly recover the row space of the original data and detect the outlier as well; for the data corrupted by arbitrary errors, LRR can also approximately recover the row space with theoretical guarantees. Since the subspace membership is provably determined by the row space, these further imply that LRR can perform robust subspace segmentation and error correction, in an efficient way.

* IEEE Trans. Pattern Analysis and Machine Intelligence, 35(2013) 171-184
* IEEE Trans. Pattern Analysis and Machine Intelligence

Via

Access Paper or Ask Questions

Closed-Form Solutions to A Category of Nuclear Norm Minimization Problems

Nov 23, 2010

Guangcan Liu, Ju Sun, Shuicheng Yan

Abstract:It is an efficient and effective strategy to utilize the nuclear norm approximation to learn low-rank matrices, which arise frequently in machine learning and computer vision. So the exploration of nuclear norm minimization problems is gaining much attention recently. In this paper we shall prove that the following Low-Rank Representation (LRR) \cite{icml_2010_lrr,lrr_extention} problem: {eqnarray*} \min_{Z} \norm{Z}_*, & {s.t.,} & X=AZ, {eqnarray*} has a unique and closed-form solution, where $X$ and $A$ are given matrices. The proof is based on proving a lemma that allows us to get closed-form solutions to a category of nuclear norm minimization problems.

* NIPS Workshop on Low-Rank Methods for Large-Scale Machine Learning, 2010

Via

Access Paper or Ask Questions

Selective Image Super-Resolution

Oct 27, 2010

Ju Sun, Qiang Chen, Shuicheng Yan, Loong-Fah Cheong

Figure 1 for Selective Image Super-Resolution

Figure 2 for Selective Image Super-Resolution

Figure 3 for Selective Image Super-Resolution

Figure 4 for Selective Image Super-Resolution

Abstract:In this paper we propose a vision system that performs image Super Resolution (SR) with selectivity. Conventional SR techniques, either by multi-image fusion or example-based construction, have failed to capitalize on the intrinsic structural and semantic context in the image, and performed "blind" resolution recovery to the entire image area. By comparison, we advocate example-based selective SR whereby selectivity is exemplified in three aspects: region selectivity (SR only at object regions), source selectivity (object SR with trained object dictionaries), and refinement selectivity (object boundaries refinement using matting). The proposed system takes over-segmented low-resolution images as inputs, assimilates recent learning techniques of sparse coding (SC) and grouped multi-task lasso (GMTL), and leads eventually to a framework for joint figure-ground separation and interest object SR. The efficiency of our framework is manifested in our experiments with subsets of the VOC2009 and MSRC datasets. We also demonstrate several interesting vision applications that can build on our system.

* 20 pages, 5 figures. Submitted to Computer Vision and Image Understanding in March 2010. Keywords: image super resolution, semantic image segmentation, vision system, vision application

Via

Access Paper or Ask Questions