Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marc Vuffray

Finite Sample Bounds for Learning with Score Matching

May 13, 2026

Devin Smedira, Abhijith Jayakumar, Sidhant Misra, Marc Vuffray, Andrey Y. Lokhov

Abstract:Learning of continuous exponential family distributions with unbounded support remains an important area of research for both theory and applications in high-dimensional statistics. In recent years, score matching has become a widely used method for learning exponential families with continuous variables due to its computational ease when compared against maximum likelihood estimation. However, theoretical understanding of the statistical properties of score matching is still lacking. In this work, we provide a non-asymptotic sample complexity analysis for learning the structure of exponential families of polynomials with score matching. The derived sample bounds show a polynomial dependence on the model dimension. These bounds are the first of its kind, as all prior work has shown only asymptotic bounds on the sample complexity.

* 22 pages

Via

Access Paper or Ask Questions

Selecting Optimal Variable Order in Autoregressive Ising Models

Feb 23, 2026

Shiba Biswal, Marc Vuffray, Andrey Y. Lokhov

Abstract:Autoregressive models enable tractable sampling from learned probability distributions, but their performance critically depends on the variable ordering used in the factorization via complexities of the resulting conditional distributions. We propose to learn the Markov random field describing the underlying data, and use the inferred graphical model structure to construct optimized variable orderings. We illustrate our approach on two-dimensional image-like models where a structure-aware ordering leads to restricted conditioning sets, thereby reducing model complexity. Numerical experiments on Ising models with discrete data demonstrate that graph-informed orderings yield higher-fidelity generated samples compared to naive variable orderings.

Via

Access Paper or Ask Questions

Computationally sufficient statistics for Ising models

Feb 12, 2026

Abhijith Jayakumar, Shreya Shukla, Marc Vuffray, Andrey Y. Lokhov, Sidhant Misra

Abstract:Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, computationally efficient algorithms for learning Gibbs distributions rely on access to full sample configurations generated from the model. For many systems of interest that arise in physical contexts, expecting a full sample to be observed is not practical, and hence it is important to look for computationally efficient methods that solve the learning problem with access to only a limited set of statistics. We examine the trade-offs between the power of computation and observation within this scenario, employing the Ising model as a paradigmatic example. We demonstrate that it is feasible to reconstruct the model parameters for a model with $\ell_1$ width $γ$ by observing statistics up to an order of $O(γ)$. This approach allows us to infer the model's structure and also learn its couplings and magnetic fields. We also discuss a setting where prior information about structure of the model is available and show that the learning problem can be solved efficiently with even more limited observational power.

Via

Access Paper or Ask Questions

Discrete distributions are learnable from metastable samples

Oct 17, 2024

Abhijith Jayakumar, Andrey Y. Lokhov, Sidhant Misra, Marc Vuffray

Abstract:Markov chain samplers designed to sample from multi-variable distributions often undesirably get stuck in specific regions of their state space. This causes such samplers to approximately sample from a metastable distribution which is usually quite different from the desired, stationary distribution of the chain. We show that single-variable conditionals of metastable distributions of reversible Markov chain samplers that satisfy a strong metastability condition are on average very close to those of the true distribution. This holds even when the metastable distribution is far away from the true model in terms of global metrics like Kullback-Leibler divergence or total variation distance. This property allows us to learn the true model using a conditional likelihood based estimator, even when the samples come from a metastable distribution concentrated in a small region of the state space. Explicit examples of such metastable states can be constructed from regions that effectively bottleneck the probability flow and cause poor mixing of the Markov chain. For specific cases of binary pairwise undirected graphical models, we extend our results to further rigorously show that data coming from metastable states can be used to learn the parameters of the energy function and recover the structure of the model.

* Preliminary version, 26 pages

Via

Access Paper or Ask Questions

Learning Energy-Based Representations of Quantum Many-Body States

Apr 08, 2023

Abhijith Jayakumar, Marc Vuffray, Andrey Y. Lokhov

Figure 1 for Learning Energy-Based Representations of Quantum Many-Body States

Figure 2 for Learning Energy-Based Representations of Quantum Many-Body States

Figure 3 for Learning Energy-Based Representations of Quantum Many-Body States

Figure 4 for Learning Energy-Based Representations of Quantum Many-Body States

Abstract:Efficient representation of quantum many-body states on classical computers is a problem of enormous practical interest. An ideal representation of a quantum state combines a succinct characterization informed by the system's structure and symmetries, along with the ability to predict the physical observables of interest. A number of machine learning approaches have been recently used to construct such classical representations [1-6] which enable predictions of observables [7] and account for physical symmetries [8]. However, the structure of a quantum state gets typically lost unless a specialized ansatz is employed based on prior knowledge of the system [9-12]. Moreover, most such approaches give no information about what states are easier to learn in comparison to others. Here, we propose a new generative energy-based representation of quantum many-body states derived from Gibbs distributions used for modeling the thermal states of classical spin systems. Based on the prior information on a family of quantum states, the energy function can be specified by a small number of parameters using an explicit low-degree polynomial or a generic parametric family such as neural nets, and can naturally include the known symmetries of the system. Our results show that such a representation can be efficiently learned from data using exact algorithms in a form that enables the prediction of expectation values of physical observables. Importantly, the structure of the learned energy function provides a natural explanation for the hardness of learning for a given class of quantum states.

Via

Access Paper or Ask Questions

High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

Sep 03, 2021

Jon Nelson, Marc Vuffray, Andrey Y. Lokhov, Tameem Albash, Carleton Coffrin

Figure 1 for High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

Figure 2 for High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

Figure 3 for High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

Figure 4 for High-quality Thermal Gibbs Sampling with Quantum Annealing Hardware

Abstract:Quantum Annealing (QA) was originally intended for accelerating the solution of combinatorial optimization tasks that have natural encodings as Ising models. However, recent experiments on QA hardware platforms have demonstrated that, in the operating regime corresponding to weak interactions, the QA hardware behaves like a noisy Gibbs sampler at a hardware-specific effective temperature. This work builds on those insights and identifies a class of small hardware-native Ising models that are robust to noise effects and proposes a novel procedure for executing these models on QA hardware to maximize Gibbs sampling performance. Experimental results indicate that the proposed protocol results in high-quality Gibbs samples from a hardware-specific effective temperature and that the QA annealing time can be used to adjust the effective temperature of the output distribution. The procedure proposed in this work provides a new approach to using QA hardware for Ising model sampling presenting potential new opportunities for applications in machine learning and physics simulation.

Via

Access Paper or Ask Questions

Single-Qubit Fidelity Assessment of Quantum Annealing Hardware

Apr 07, 2021

Jon Nelson, Marc Vuffray, Andrey Y. Lokhov, Carleton Coffrin

Figure 1 for Single-Qubit Fidelity Assessment of Quantum Annealing Hardware

Figure 2 for Single-Qubit Fidelity Assessment of Quantum Annealing Hardware

Figure 3 for Single-Qubit Fidelity Assessment of Quantum Annealing Hardware

Figure 4 for Single-Qubit Fidelity Assessment of Quantum Annealing Hardware

Abstract:As a wide variety of quantum computing platforms become available, methods for assessing and comparing the performance of these devices are of increasing interest and importance. Inspired by the success of single-qubit error rate computations for tracking the progress of gate-based quantum computers, this work proposes a Quantum Annealing Single-qubit Assessment (QASA) protocol for quantifying the performance of individual qubits in quantum annealing computers. The proposed protocol scales to large quantum annealers with thousands of qubits and provides unique insights into the distribution of qubit properties within a particular hardware device. The efficacy of the QASA protocol is demonstrated by analyzing the properties of a D-Wave 2000Q system, revealing unanticipated correlations in the qubit performance of that device. A study repeating the QASA protocol at different annealing times highlights how the method can be utilized to understand the impact of annealing parameters on qubit performance. Overall, the proposed QASA protocol provides a useful tool for assessing the performance of current and emerging quantum annealing devices.

Via

Access Paper or Ask Questions

Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Apr 02, 2021

Arkopal Dutt, Andrey Y. Lokhov, Marc Vuffray, Sidhant Misra

Figure 1 for Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Figure 2 for Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Figure 3 for Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Figure 4 for Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics

Abstract:The usual setting for learning the structure and parameters of a graphical model assumes the availability of independent samples produced from the corresponding multivariate probability distribution. However, for many models the mixing time of the respective Markov chain can be very large and i.i.d. samples may not be obtained. We study the problem of reconstructing binary graphical models from correlated samples produced by a dynamical process, which is natural in many applications. We analyze the sample complexity of two estimators that are based on the interaction screening objective and the conditional likelihood loss. We observe that for samples coming from a dynamical process far from equilibrium, the sample complexity reduces exponentially compared to a dynamical process that mixes quickly.

Via

Access Paper or Ask Questions

Learning Continuous Exponential Families Beyond Gaussian

Feb 18, 2021

Christopher X. Ren, Sidhant Misra, Marc Vuffray, Andrey Y. Lokhov

Figure 1 for Learning Continuous Exponential Families Beyond Gaussian

Figure 2 for Learning Continuous Exponential Families Beyond Gaussian

Figure 3 for Learning Continuous Exponential Families Beyond Gaussian

Figure 4 for Learning Continuous Exponential Families Beyond Gaussian

Abstract:We address the problem of learning of continuous exponential family distributions with unbounded support. While a lot of progress has been made on learning of Gaussian graphical models, we are still lacking scalable algorithms for reconstructing general continuous exponential families modeling higher-order moments of the data beyond the mean and the covariance. Here, we introduce a computationally efficient method for learning continuous graphical models based on the Interaction Screening approach. Through a series of numerical experiments, we show that our estimator maintains similar requirements in terms of accuracy and sample complexity compared to alternative approaches such as maximization of conditional likelihood, while considerably improving upon the algorithm's run-time.

* 11 pages, 11 figures

Via

Access Paper or Ask Questions

Programmable Quantum Annealers as Noisy Gibbs Samplers

Dec 16, 2020

Marc Vuffray, Carleton Coffrin, Yaroslav A. Kharkov, Andrey Y. Lokhov

Figure 1 for Programmable Quantum Annealers as Noisy Gibbs Samplers

Figure 2 for Programmable Quantum Annealers as Noisy Gibbs Samplers

Figure 3 for Programmable Quantum Annealers as Noisy Gibbs Samplers

Figure 4 for Programmable Quantum Annealers as Noisy Gibbs Samplers

Abstract:Drawing independent samples from high-dimensional probability distributions represents the major computational bottleneck for modern algorithms, including powerful machine learning frameworks such as deep learning. The quest for discovering larger families of distributions for which sampling can be efficiently realized has inspired an exploration beyond established computing methods and turning to novel physical devices that leverage the principles of quantum computation. Quantum annealing embodies a promising computational paradigm that is intimately related to the complexity of energy landscapes in Gibbs distributions, which relate the probabilities of system states to the energies of these states. Here, we study the sampling properties of physical realizations of quantum annealers which are implemented through programmable lattices of superconducting flux qubits. Comprehensive statistical analysis of the data produced by these quantum machines shows that quantum annealers behave as samplers that generate independent configurations from low-temperature noisy Gibbs distributions. We show that the structure of the output distribution probes the intrinsic physical properties of the quantum device such as effective temperature of individual qubits and magnitude of local qubit noise, which result in a non-linear response function and spurious interactions that are absent in the hardware implementation. We anticipate that our methodology will find widespread use in characterization of future generations of quantum annealers and other emerging analog computing devices.

* 6 pages, 4 figures, with 36 pages of Supplementary Information

Via

Access Paper or Ask Questions