Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Behnam Neyshabur

Shammie

Clustering, Hamming Embedding, Generalized LSH and the Max Norm

May 13, 2014

Behnam Neyshabur, Yury Makarychev, Nathan Srebro

Abstract:We study the convex relaxation of clustering and hamming embedding, focusing on the asymmetric case (co-clustering and asymmetric hamming embedding), understanding their relationship to LSH as studied by (Charikar 2002) and to the max-norm ball, and the differences between their symmetric and asymmetric versions.

* 17 pages

Via

Access Paper or Ask Questions

Sparse Matrix Factorization

May 13, 2014

Behnam Neyshabur, Rina Panigrahy

Abstract:We investigate the problem of factorizing a matrix into several sparse matrices and propose an algorithm for this under randomness and sparsity assumptions. This problem can be viewed as a simplification of the deep learning problem where finding a factorization corresponds to finding edges in different layers and values of hidden units. We prove that under certain assumptions for a sparse linear deep network with $n$ nodes in each layer, our algorithm is able to recover the structure of the network and values of top layer hidden units for depths up to $\tilde O(n^{1/6})$. We further discuss the relation among sparse matrix factorization, deep learning, sparse recovery and dictionary learning.

* 20 pages

Via

Access Paper or Ask Questions

The Power of Asymmetry in Binary Hashing

Nov 29, 2013

Behnam Neyshabur, Payman Yadollahpour, Yury Makarychev, Ruslan Salakhutdinov, Nathan Srebro

Figure 1 for The Power of Asymmetry in Binary Hashing

Figure 2 for The Power of Asymmetry in Binary Hashing

Figure 3 for The Power of Asymmetry in Binary Hashing

Figure 4 for The Power of Asymmetry in Binary Hashing

Abstract:When approximating binary similarity using the hamming distance between short binary hashes, we show that even if the similarity is symmetric, we can have shorter and more accurate hashes by using two distinct code maps. I.e. by approximating the similarity between $x$ and $x'$ as the hamming distance between $f(x)$ and $g(x')$, for two distinct binary codes $f,g$, rather than as the hamming distance between $f(x)$ and $f(x')$.

* Accepted to NIPS 2013, 9 pages, 5 figures

Via

Access Paper or Ask Questions