Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arnaud Breloy

LEME, Paris Nanterre University, Paris, France

Riemannian Change Point Detection on Manifolds with Robust Centroid Estimation

Aug 25, 2025

Xiuheng Wang, Ricardo Borsoi, Arnaud Breloy, Cédric Richard

Abstract:Non-parametric change-point detection in streaming time series data is a long-standing challenge in signal processing. Recent advancements in statistics and machine learning have increasingly addressed this problem for data residing on Riemannian manifolds. One prominent strategy involves monitoring abrupt changes in the center of mass of the time series. Implemented in a streaming fashion, this strategy, however, requires careful step size tuning when computing the updates of the center of mass. In this paper, we propose to leverage robust centroid on manifolds from M-estimation theory to address this issue. Our proposal consists of comparing two centroid estimates: the classical Karcher mean (sensitive to change) versus one defined from Huber's function (robust to change). This comparison leads to the definition of a test statistic whose performance is less sensitive to the underlying estimation method. We propose a stochastic Riemannian optimization algorithm to estimate both robust centroids efficiently. Experiments conducted on both simulated and real-world data across two representative manifolds demonstrate the superior performance of our proposed method.

Via

Access Paper or Ask Questions

Leveraging Low-rank Factorizations of Conditional Correlation Matrices in Graph Learning

Jun 12, 2025

Thu Ha Phi, Alexandre Hippert-Ferrer, Florent Bouchard, Arnaud Breloy

Abstract:This paper addresses the problem of learning an undirected graph from data gathered at each nodes. Within the graph signal processing framework, the topology of such graph can be linked to the support of the conditional correlation matrix of the data. The corresponding graph learning problem then scales to the squares of the number of variables (nodes), which is usually problematic at large dimension. To tackle this issue, we propose a graph learning framework that leverages a low-rank factorization of the conditional correlation matrix. In order to solve for the resulting optimization problems, we derive tools required to apply Riemannian optimization techniques for this particular structure. The proposal is then particularized to a low-rank constrained counterpart of the GLasso algorithm, i.e., the penalized maximum likelihood estimation of a Gaussian graphical model. Experiments on synthetic and real data evidence that a very efficient dimension-versus-performance trade-off can be achieved with this approach.

* 11 pages, 5 figures

Via

Access Paper or Ask Questions

Sparse PCA with False Discovery Rate Controlled Variable Selection

Jan 16, 2024

Jasin Machkour, Arnaud Breloy, Michael Muma, Daniel P. Palomar, Frédéric Pascal

Figure 1 for Sparse PCA with False Discovery Rate Controlled Variable Selection

Figure 2 for Sparse PCA with False Discovery Rate Controlled Variable Selection

Figure 3 for Sparse PCA with False Discovery Rate Controlled Variable Selection

Figure 4 for Sparse PCA with False Discovery Rate Controlled Variable Selection

Abstract:Sparse principal component analysis (PCA) aims at mapping large dimensional data to a linear subspace of lower dimension. By imposing loading vectors to be sparse, it performs the double duty of dimension reduction and variable selection. Sparse PCA algorithms are usually expressed as a trade-off between explained variance and sparsity of the loading vectors (i.e., number of selected variables). As a high explained variance is not necessarily synonymous with relevant information, these methods are prone to select irrelevant variables. To overcome this issue, we propose an alternative formulation of sparse PCA driven by the false discovery rate (FDR). We then leverage the Terminating-Random Experiments (T-Rex) selector to automatically determine an FDR-controlled support of the loading vectors. A major advantage of the resulting T-Rex PCA is that no sparsity parameter tuning is required. Numerical experiments and a stock market data example demonstrate a significant performance improvement.

* Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), scheduled for 14-19 April 2024 in Seoul, Korea

Via

Access Paper or Ask Questions

Natural Bayesian Cramér-Rao Bound with an Application to Covariance Estimation

Nov 08, 2023

Florent Bouchard, Alexandre Renaux, Guillaume Ginolhac, Arnaud Breloy

Figure 1 for Natural Bayesian Cramér-Rao Bound with an Application to Covariance Estimation

Figure 2 for Natural Bayesian Cramér-Rao Bound with an Application to Covariance Estimation

Figure 3 for Natural Bayesian Cramér-Rao Bound with an Application to Covariance Estimation

Figure 4 for Natural Bayesian Cramér-Rao Bound with an Application to Covariance Estimation

Abstract:In this paper, we propose to develop a new Cram\'er-Rao Bound (CRB) when the parameter to estimate lies in a manifold and follows a prior distribution. This derivation leads to a natural inequality between an error criteria based on geometrical properties and this new bound. This main contribution is illustrated in the problem of covariance estimation when the data follow a Gaussian distribution and the prior distribution is an inverse Wishart. Numerical simulation shows new results where the proposed CRB allows to exhibit interesting properties of the MAP estimator which are not observed with the classical Bayesian CRB.

Via

Access Paper or Ask Questions

The Fisher-Rao geometry of CES distributions

Oct 02, 2023

Florent Bouchard, Arnaud Breloy, Antoine Collas, Alexandre Renaux, Guillaume Ginolhac

Figure 1 for The Fisher-Rao geometry of CES distributions

Figure 2 for The Fisher-Rao geometry of CES distributions

Figure 3 for The Fisher-Rao geometry of CES distributions

Figure 4 for The Fisher-Rao geometry of CES distributions

Abstract:When dealing with a parametric statistical model, a Riemannian manifold can naturally appear by endowing the parameter space with the Fisher information metric. The geometry induced on the parameters by this metric is then referred to as the Fisher-Rao information geometry. Interestingly, this yields a point of view that allows for leveragingmany tools from differential geometry. After a brief introduction about these concepts, we will present some practical uses of these geometric tools in the framework of elliptical distributions. This second part of the exposition is divided into three main axes: Riemannian optimization for covariance matrix estimation, Intrinsic Cram\'er-Rao bounds, and classification using Riemannian distances.

Via

Access Paper or Ask Questions

Entropic Wasserstein Component Analysis

Mar 09, 2023

Antoine Collas, Titouan Vayer, Rémi Flamary, Arnaud Breloy

Abstract:Dimension reduction (DR) methods provide systematic approaches for analyzing high-dimensional data. A key requirement for DR is to incorporate global dependencies among original and embedded samples while preserving clusters in the embedding space. To achieve this, we combine the principles of optimal transport (OT) and principal component analysis (PCA). Our method seeks the best linear subspace that minimizes reconstruction error using entropic OT, which naturally encodes the neighborhood information of the samples. From an algorithmic standpoint, we propose an efficient block-majorization-minimization solver over the Stiefel manifold. Our experimental results demonstrate that our approach can effectively preserve high-dimensional clusters, leading to more interpretable and effective embeddings. Python code of the algorithms and experiments is available online.

Via

Access Paper or Ask Questions

Riemannian optimization for non-centered mixture of scaled Gaussian distributions

Sep 07, 2022

Antoine Collas, Arnaud Breloy, Chengfang Ren, Guillaume Ginolhac, Jean-Philippe Ovarlez

Figure 1 for Riemannian optimization for non-centered mixture of scaled Gaussian distributions

Figure 2 for Riemannian optimization for non-centered mixture of scaled Gaussian distributions

Figure 3 for Riemannian optimization for non-centered mixture of scaled Gaussian distributions

Figure 4 for Riemannian optimization for non-centered mixture of scaled Gaussian distributions

Abstract:This paper studies the statistical model of the non-centered mixture of scaled Gaussian distributions (NC-MSG). Using the Fisher-Rao information geometry associated to this distribution, we derive a Riemannian gradient descent algorithm. This algorithm is leveraged for two minimization problems. The first one is the minimization of a regularized negative log- likelihood (NLL). The latter makes the trade-off between a white Gaussian distribution and the NC-MSG. Conditions on the regularization are given so that the existence of a minimum to this problem is guaranteed without assumptions on the samples. Then, the Kullback-Leibler (KL) divergence between two NC-MSG is derived. This divergence enables us to define a minimization problem to compute centers of mass of several NC-MSGs. The proposed Riemannian gradient descent algorithm is leveraged to solve this second minimization problem. Numerical experiments show the good performance and the speed of the Riemannian gradient descent on the two problems. Finally, a Nearest centroid classifier is implemented leveraging the KL divergence and its associated center of mass. Applied on the large scale dataset Breizhcrops, this classifier shows good accuracies as well as robustness to rigid transformations of the test set.

Via

Access Paper or Ask Questions

Robust Geometric Metric Learning

Feb 23, 2022

Antoine Collas, Arnaud Breloy, Guillaume Ginolhac, Chengfang Ren, Jean-Philippe Ovarlez

Figure 1 for Robust Geometric Metric Learning

Figure 2 for Robust Geometric Metric Learning

Abstract:This paper proposes new algorithms for the metric learning problem. We start by noticing that several classical metric learning formulations from the literature can be viewed as modified covariance matrix estimation problems. Leveraging this point of view, a general approach, called Robust Geometric Metric Learning (RGML), is then studied. This method aims at simultaneously estimating the covariance matrix of each class while shrinking them towards their (unknown) barycenter. We focus on two specific costs functions: one associated with the Gaussian likelihood (RGML Gaussian), and one with Tyler's M -estimator (RGML Tyler). In both, the barycenter is defined with the Riemannian distance, which enjoys nice properties of geodesic convexity and affine invariance. The optimization is performed using the Riemannian geometry of symmetric positive definite matrices and its submanifold of unit determinant. Finally, the performance of RGML is asserted on real datasets. Strong performance is exhibited while being robust to mislabeled data.

Via

Access Paper or Ask Questions

Multifrequency Array Calibration in Presence of Radio Frequency Interferences

Feb 15, 2022

Yassine Mhiri, Mohammed Nabil El Korso, Arnaud Breloy, Pascal Larzabal

Figure 1 for Multifrequency Array Calibration in Presence of Radio Frequency Interferences

Figure 2 for Multifrequency Array Calibration in Presence of Radio Frequency Interferences

Figure 3 for Multifrequency Array Calibration in Presence of Radio Frequency Interferences

Figure 4 for Multifrequency Array Calibration in Presence of Radio Frequency Interferences

Abstract:Radio interferometers are phased arrays producing high-resolution images from the covariance matrix of measurements. Calibration of such instruments is necessary and is a critical task. This is how the estimation of instrumental errors is usually done thanks to the knowledge of referenced celestial sources. However, the use of high sensitive antennas in modern radio interferometers (LOFAR, SKA) brings a new challenge in radio astronomy because there are more sensitive to Radio Frequency Interferences (RFI). The presence of RFI during the calibration process generally induces biases in state-of-the-art solutions. The purpose of this paper is to propose an alternative to alleviate the effects of RFI. For that, we first propose a model to take into account the presence of RFI in the data across multiple frequency channels thanks to a low-rank structured noise. We then achieve maximum likelihood estimation of the calibration parameters with a Space Alternating Generalized Expectation-Maximization (SAGE) algorithm for which we derive originally two sets of complete data allowing close form expressions for the updates. Numerical simulations show a significant gain in performance for RFI corrupted data in comparison with some more classical methods.

Via

Access Paper or Ask Questions

Robust low-rank covariance matrix estimation with a general pattern of missing values

Jul 22, 2021

Alexandre Hippert-Ferrer, Mohammed Nabil El Korso, Arnaud Breloy, Guillaume Ginolhac

Figure 1 for Robust low-rank covariance matrix estimation with a general pattern of missing values

Figure 2 for Robust low-rank covariance matrix estimation with a general pattern of missing values

Figure 3 for Robust low-rank covariance matrix estimation with a general pattern of missing values

Figure 4 for Robust low-rank covariance matrix estimation with a general pattern of missing values

Abstract:This paper tackles the problem of robust covariance matrix estimation when the data is incomplete. Classical statistical estimation methodologies are usually built upon the Gaussian assumption, whereas existing robust estimation ones assume unstructured signal models. The former can be inaccurate in real-world data sets in which heterogeneity causes heavy-tail distributions, while the latter does not profit from the usual low-rank structure of the signal. Taking advantage of both worlds, a covariance matrix estimation procedure is designed on a robust (compound Gaussian) low-rank model by leveraging the observed-data likelihood function within an expectation-maximization algorithm. It is also designed to handle general pattern of missing values. The proposed procedure is first validated on simulated data sets. Then, its interest for classification and clustering applications is assessed on two real data sets with missing values, which include multispectral and hyperspectral time series.

* 32 pages, 10 figures, submitted to Elsevier Signal Processing

Via

Access Paper or Ask Questions