Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kyle Cranmer

Hamiltonian Graph Networks with ODE Integrators

Sep 27, 2019

Alvaro Sanchez-Gonzalez, Victor Bapst, Kyle Cranmer, Peter Battaglia

Figure 1 for Hamiltonian Graph Networks with ODE Integrators

Figure 2 for Hamiltonian Graph Networks with ODE Integrators

Figure 3 for Hamiltonian Graph Networks with ODE Integrators

Figure 4 for Hamiltonian Graph Networks with ODE Integrators

Abstract:We introduce an approach for imposing physically informed inductive biases in learned simulation models. We combine graph networks with a differentiable ordinary differential equation integrator as a mechanism for predicting future states, and a Hamiltonian as an internal representation. We find that our approach outperforms baselines without these biases in terms of predictive accuracy, energy accuracy, and zero-shot generalization to time-step sizes and integrator orders not experienced during training. This advances the state-of-the-art of learned simulation, and in principle is applicable beyond physical domains.

Via

Access Paper or Ask Questions

MadMiner: Machine learning-based inference for particle physics

Jul 24, 2019

Johann Brehmer, Felix Kling, Irina Espejo, Kyle Cranmer

Figure 1 for MadMiner: Machine learning-based inference for particle physics

Figure 2 for MadMiner: Machine learning-based inference for particle physics

Figure 3 for MadMiner: Machine learning-based inference for particle physics

Figure 4 for MadMiner: Machine learning-based inference for particle physics

Abstract:The legacy measurements of the LHC will require analyzing high-dimensional event data for subtle kinematic signatures, which is challenging for established analysis methods. Recently, a powerful family of multivariate inference techniques that leverage both matrix element information and machine learning has been developed. This approach neither requires the reduction of high-dimensional data to summary statistics nor any simplifications to the underlying physics or detector response. In this paper we introduce MadMiner, a Python module that streamlines the steps involved in this procedure. Wrapping around MadGraph5_aMC and Pythia 8, it supports almost any physics process and model. To aid phenomenological studies, the tool also wraps around Delphes 3, though it is extendable to a full Geant4-based detector simulation. We demonstrate the use of MadMiner in an example analysis of dimension-six operators in ttH production, finding that the new techniques substantially increase the sensitivity to new physics.

* MadMiner is available at https://github.com/diana-hep/madminer

Via

Access Paper or Ask Questions

Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Jul 08, 2019

Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe(+7 more)

Figure 1 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 2 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 3 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 4 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Abstract:Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models. However, applications to science remain limited because of the impracticability of rewriting complex scientific simulators in a PPL, the computational cost of inference, and the lack of scalable implementations. To address these, we present a novel PPL framework that couples directly to existing scientific simulators through a cross-platform probabilistic execution protocol and provides Markov chain Monte Carlo (MCMC) and deep-learning-based inference compilation (IC) engines for tractable inference. To guide IC inference, we perform distributed training of a dynamic 3DCNN--LSTM architecture with a PyTorch-MPI-based framework on 1,024 32-core CPU nodes of the Cori supercomputer with a global minibatch size of 128k: achieving a performance of 450 Tflop/s through enhancements to PyTorch. We demonstrate a Large Hadron Collider (LHC) use-case with the C++ Sherpa simulator and achieve the largest-scale posterior inference in a Turing-complete PPL.

* 14 pages, 8 figures

Via

Access Paper or Ask Questions

Effective LHC measurements with matrix elements and machine learning

Jun 04, 2019

Johann Brehmer, Kyle Cranmer, Irina Espejo, Felix Kling, Gilles Louppe, Juan Pavez

Figure 1 for Effective LHC measurements with matrix elements and machine learning

Abstract:One major challenge for the legacy measurements at the LHC is that the likelihood function is not tractable when the collected data is high-dimensional and the detector response has to be modeled. We review how different analysis strategies solve this issue, including the traditional histogram approach used in most particle physics analyses, the Matrix Element Method, Optimal Observables, and modern techniques based on neural density estimation. We then discuss powerful new inference methods that use a combination of matrix element information and machine learning to accurately estimate the likelihood function. The MadMiner package automates all necessary data-processing steps. In first studies we find that these new techniques have the potential to substantially improve the sensitivity of the LHC legacy measurements.

* Keynote at the 19th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2019)

Via

Access Paper or Ask Questions

Inferring the quantum density matrix with machine learning

Apr 11, 2019

Kyle Cranmer, Siavash Golkar, Duccio Pappadopulo

Figure 1 for Inferring the quantum density matrix with machine learning

Figure 2 for Inferring the quantum density matrix with machine learning

Figure 3 for Inferring the quantum density matrix with machine learning

Figure 4 for Inferring the quantum density matrix with machine learning

Abstract:We introduce two methods for estimating the density matrix for a quantum system: Quantum Maximum Likelihood and Quantum Variational Inference. In these methods, we construct a variational family to model the density matrix of a mixed quantum state. We also introduce quantum flows, the quantum analog of normalizing flows, which can be used to increase the expressivity of this variational family. The eigenstates and eigenvalues of interest are then derived by optimizing an appropriate loss function. The approach is qualitatively different than traditional lattice techniques that rely on the time dependence of correlation functions that summarize the lattice configurations. The resulting estimate of the density matrix can then be used to evaluate the expectation of an arbitrary operator, which opens the door to new possibilities.

* 12 pages, 3 figures

Via

Access Paper or Ask Questions

Mining gold from implicit models to improve likelihood-free inference

Oct 09, 2018

Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

Figure 1 for Mining gold from implicit models to improve likelihood-free inference

Figure 2 for Mining gold from implicit models to improve likelihood-free inference

Figure 3 for Mining gold from implicit models to improve likelihood-free inference

Figure 4 for Mining gold from implicit models to improve likelihood-free inference

Abstract:Simulators often provide the best description of real-world phenomena. However, they also lead to challenging inverse problems because the density they implicitly define is often intractable. We present a new suite of simulation-based inference techniques that go beyond the traditional Approximate Bayesian Computation approach, which struggles in a high-dimensional setting, and extend methods that use surrogate models based on neural networks. We show that additional information, such as the joint likelihood ratio and the joint score, can often be extracted from simulators and used to augment the training data for these surrogate models. Finally, we demonstrate that these new techniques are more sample efficient and provide higher-fidelity inference than traditional methods.

* Code available at https://github.com/johannbrehmer/simulator-mining-example . v2: Fixed typos. v3: Expanded discussion, added Lotka-Volterra example

Via

Access Paper or Ask Questions

Adversarial Variational Optimization of Non-Differentiable Simulators

Oct 05, 2018

Gilles Louppe, Joeri Hermans, Kyle Cranmer

Figure 1 for Adversarial Variational Optimization of Non-Differentiable Simulators

Figure 2 for Adversarial Variational Optimization of Non-Differentiable Simulators

Figure 3 for Adversarial Variational Optimization of Non-Differentiable Simulators

Figure 4 for Adversarial Variational Optimization of Non-Differentiable Simulators

Abstract:Complex computer simulators are increasingly used across fields of science as generative models tying parameters of an underlying theory to experimental observations. Inference in this setup is often difficult, as simulators rarely admit a tractable density or likelihood function. We introduce Adversarial Variational Optimization (AVO), a likelihood-free inference algorithm for fitting a non-differentiable generative model incorporating ideas from generative adversarial networks, variational optimization and empirical Bayes. We adapt the training procedure of generative adversarial networks by replacing the differentiable generative network with a domain-specific simulator. We solve the resulting non-differentiable minimax problem by minimizing variational upper bounds of the two adversarial objectives. Effectively, the procedure results in learning a proposal distribution over simulator parameters, such that the JS divergence between the marginal distribution of the synthetic data and the empirical distribution of observed data is minimized. We evaluate and compare the method with simulators producing both discrete and continuous data.

Via

Access Paper or Ask Questions

Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Sep 01, 2018

Atilim Gunes Baydin, Lukas Heinrich, Wahid Bhimji, Bradley Gram-Hansen, Gilles Louppe, Lei Shao, Prabhat, Kyle Cranmer, Frank Wood

Figure 1 for Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Figure 2 for Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Figure 3 for Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Figure 4 for Efficient Probabilistic Inference in the Quest for Physics Beyond the Standard Model

Abstract:We present a novel framework that enables efficient probabilistic inference in large-scale scientific models by allowing the execution of existing domain-specific simulators as probabilistic programs, resulting in highly interpretable posterior inference. Our framework is general purpose and scalable, and is based on a cross-platform probabilistic execution protocol through which an inference engine can control simulators in a language-agnostic way. We demonstrate the technique in particle physics, on a scientifically accurate simulation of the tau lepton decay, which is a key ingredient in establishing the properties of the Higgs boson. High-energy physics has a rich set of simulators based on quantum field theory and the interaction of particles in matter. We show how to use probabilistic programming to perform Bayesian inference in these existing simulator codebases directly, in particular conditioning on observable outputs from a simulated particle detector to directly produce an interpretable posterior distribution over decay pathways. Inference efficiency is achieved via inference compilation where a deep recurrent neural network is trained to parameterize proposal distributions and control the stochastic simulator in a sequential importance sampling scheme, at a fraction of the computational cost of Markov chain Monte Carlo sampling.

* 18 pages, 5 figures

Via

Access Paper or Ask Questions

Likelihood-free inference with an improved cross-entropy estimator

Aug 02, 2018

Markus Stoye, Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

Figure 1 for Likelihood-free inference with an improved cross-entropy estimator

Figure 2 for Likelihood-free inference with an improved cross-entropy estimator

Figure 3 for Likelihood-free inference with an improved cross-entropy estimator

Figure 4 for Likelihood-free inference with an improved cross-entropy estimator

Abstract:We extend recent work (Brehmer, et. al., 2018) that use neural networks as surrogate models for likelihood-free inference. As in the previous work, we exploit the fact that the joint likelihood ratio and joint score, conditioned on both observed and latent variables, can often be extracted from an implicit generative model or simulator to augment the training data for these surrogate models. We show how this augmented training data can be used to provide a new cross-entropy estimator, which provides improved sample efficiency compared to previous loss functions exploiting this augmented training data.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

A Guide to Constraining Effective Field Theories with Machine Learning

Jul 26, 2018

Johann Brehmer, Kyle Cranmer, Gilles Louppe, Juan Pavez

Figure 1 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 2 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 3 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 4 for A Guide to Constraining Effective Field Theories with Machine Learning

Abstract:We develop, discuss, and compare several inference techniques to constrain theory parameters in collider experiments. By harnessing the latent-space structure of particle physics processes, we extract extra information from the simulator. This augmented data can be used to train neural networks that precisely estimate the likelihood ratio. The new methods scale well to many observables and high-dimensional parameter spaces, do not require any approximations of the parton shower and detector response, and can be evaluated in microseconds. Using weak-boson-fusion Higgs production as an example process, we compare the performance of several techniques. The best results are found for likelihood ratio estimators trained with extra information about the score, the gradient of the log likelihood function with respect to the theory parameters. The score also provides sufficient statistics that contain all the information needed for inference in the neighborhood of the Standard Model. These methods enable us to put significantly stronger bounds on effective dimension-six operators than the traditional approach based on histograms. They also outperform generic machine learning methods that do not make use of the particle physics structure, demonstrating their potential to substantially improve the new physics reach of the LHC legacy results.

* Phys. Rev. D 98, 052004 (2018)
* See also the companion publication "Constraining Effective Field Theories with Machine Learning" at arXiv:1805.00013, a brief introduction presenting the key ideas. The code for these studies is available at https://github.com/johannbrehmer/higgs_inference . v2: Added references. v3: Improved description of algorithms, added references. v4: Clarified text, added references

Via

Access Paper or Ask Questions