Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alaa Saade

Dima

Clustering from Sparse Pairwise Measurements

May 19, 2016

Alaa Saade, Marc Lelarge, Florent Krzakala, Lenka Zdeborová

Figure 1 for Clustering from Sparse Pairwise Measurements

Figure 2 for Clustering from Sparse Pairwise Measurements

Abstract:We consider the problem of grouping items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that these algorithms are asymptotically optimal in that they detect the clusters as soon as it is information theoretically possible to do so. We substantiate this claim for one of the spectral approaches we introduce.

* Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT) Pages: 780 - 784

Via

Access Paper or Ask Questions

Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Jan 28, 2016

Alaa Saade, Florent Krzakala, Lenka Zdeborová

Figure 1 for Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Figure 2 for Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Figure 3 for Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Figure 4 for Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Abstract:The completion of low rank matrices from few entries is a task with many practical applications. We consider here two aspects of this problem: detectability, i.e. the ability to estimate the rank $r$ reliably from the fewest possible random entries, and performance in achieving small reconstruction error. We propose a spectral algorithm for these two tasks called MaCBetH (for Matrix Completion with the Bethe Hessian). The rank is estimated as the number of negative eigenvalues of the Bethe Hessian matrix, and the corresponding eigenvectors are used as initial condition for the minimization of the discrepancy between the estimated matrix and the revealed entries. We analyze the performance in a random matrix setting using results from the statistical mechanics of the Hopfield neural network, and show in particular that MaCBetH efficiently detects the rank $r$ of a large $n\times m$ matrix from $C(r)r\sqrt{nm}$ entries, where $C(r)$ is a constant close to $1$. We also evaluate the corresponding root-mean-square error empirically and show that MaCBetH compares favorably to other existing approaches.

* Advances in Neural Information Processing Systems (NIPS 2015) 28, pages 1261--1269
* NIPS Conference 2015

Via

Access Paper or Ask Questions

Random Projections through multiple optical scattering: Approximating kernels at the speed of light

Oct 25, 2015

Alaa Saade, Francesco Caltagirone, Igor Carron, Laurent Daudet, Angélique Drémeau, Sylvain Gigan, Florent Krzakala

Figure 1 for Random Projections through multiple optical scattering: Approximating kernels at the speed of light

Figure 2 for Random Projections through multiple optical scattering: Approximating kernels at the speed of light

Abstract:Random projections have proven extremely useful in many signal processing and machine learning applications. However, they often require either to store a very large random matrix, or to use a different, structured matrix to reduce the computational and memory costs. Here, we overcome this difficulty by proposing an analog, optical device, that performs the random projections literally at the speed of light without having to store any matrix in memory. This is achieved using the physical properties of multiple coherent scattering of coherent light in random media. We use this device on a simple task of classification with a kernel machine, and we show that, on the MNIST database, the experimental results closely match the theoretical performance of the corresponding kernel. This framework can help make kernel methods practical for applications that have large training sets and/or require real-time prediction. We discuss possible extensions of the method in terms of a class of kernels, speed, memory consumption and different problems.

* Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pages: 6215 - 6219

Via

Access Paper or Ask Questions

Spectral Detection in the Censored Block Model

Jun 10, 2015

Alaa Saade, Florent Krzakala, Marc Lelarge, Lenka Zdeborová

Figure 1 for Spectral Detection in the Censored Block Model

Figure 2 for Spectral Detection in the Censored Block Model

Abstract:We consider the problem of partially recovering hidden binary variables from the observation of (few) censored edge weights, a problem with applications in community detection, correlation clustering and synchronization. We describe two spectral algorithms for this task based on the non-backtracking and the Bethe Hessian operators. These algorithms are shown to be asymptotically optimal for the partial recovery problem, in that they detect the hidden assignment as soon as it is information theoretically possible to do so.

* Information Theory (ISIT), 2015 IEEE International Symposium on , vol., no., pp.1184-1188, 14-19 June 2015
* ISIT 2015

Via

Access Paper or Ask Questions

Spectral Clustering of Graphs with the Bethe Hessian

Sep 08, 2014

Alaa Saade, Florent Krzakala, Lenka Zdeborová

Figure 1 for Spectral Clustering of Graphs with the Bethe Hessian

Figure 2 for Spectral Clustering of Graphs with the Bethe Hessian

Figure 3 for Spectral Clustering of Graphs with the Bethe Hessian

Abstract:Spectral clustering is a standard approach to label nodes on a graph by studying the (largest or lowest) eigenvalues of a symmetric real matrix such as e.g. the adjacency or the Laplacian. Recently, it has been argued that using instead a more complicated, non-symmetric and higher dimensional operator, related to the non-backtracking walk on the graph, leads to improved performance in detecting clusters, and even to optimal performance for the stochastic block model. Here, we propose to use instead a simpler object, a symmetric real matrix known as the Bethe Hessian operator, or deformed Laplacian. We show that this approach combines the performances of the non-backtracking operator, thus detecting clusters all the way down to the theoretical limit in the stochastic block model, with the computational, theoretical and memory advantages of real symmetric matrices.

* Advances in Neural Information Processing Systems 27 (NIPS 2014) pp 406-414
* 8 pages, 2 figures

Via

Access Paper or Ask Questions