Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tobias Golling

Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Sep 12, 2023

Tobias Golling, Samuel Klein, Radha Mastandrea, Benjamin Nachman, John Andrew Raine

Figure 1 for Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Figure 2 for Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Figure 3 for Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Figure 4 for Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Abstract:Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for morphing because they require knowledge of the probability density of the starting dataset. In most cases in particle physics, we can generate more examples, but we do not know densities explicitly. We propose a protocol called flows for flows for training normalizing flows to morph one dataset into another even if the underlying probability density of neither dataset is known explicitly. This enables a morphing strategy trained with maximum likelihood estimation, a setup that has been shown to be highly effective in related tasks. We study variations on this protocol to explore how far the data points are moved to statistically match the two datasets. Furthermore, we show how to condition the learned flows on particular features in order to create a morphing function for every value of the conditioning feature. For illustration, we demonstrate flows for flows for toy examples as well as a collider physics example involving dijet events

* 15 pages, 17 figures. This work is a merger of arXiv:2211.02487 and arXiv:2212.06155

Via

Access Paper or Ask Questions

$ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

Jul 20, 2023

John Andrew Raine, Matthew Leigh, Knut Zoch, Tobias Golling

Figure 1 for $ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

Figure 2 for $ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

Figure 3 for $ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

Figure 4 for $ν^2$-Flows: Fast and improved neutrino reconstruction in multi-neutrino final states with conditional normalizing flows

Abstract:In this work we introduce $\nu^2$-Flows, an extension of the $\nu$-Flows method to final states containing multiple neutrinos. The architecture can natively scale for all combinations of object types and multiplicities in the final state for any desired neutrino multiplicities. In $t\bar{t}$ dilepton events, the momenta of both neutrinos and correlations between them are reconstructed more accurately than when using the most popular standard analytical techniques, and solutions are found for all events. Inference time is significantly faster than competing methods, and can be reduced further by evaluating in parallel on graphics processing units. We apply $\nu^2$-Flows to $t\bar{t}$ dilepton events and show that the per-bin uncertainties in unfolded distributions is much closer to the limit of performance set by perfect neutrino reconstruction than standard techniques. For the chosen double differential observables $\nu^2$-Flows results in improved statistical precision for each bin by a factor of 1.5 to 2 in comparison to the Neutrino Weighting method and up to a factor of four in comparison to the Ellipse approach.

* 20 pages, 16 figures, 5 tables

Via

Access Paper or Ask Questions

PC-Droid: Faster diffusion and improved quality for particle cloud generation

Jul 14, 2023

Matthew Leigh, Debajyoti Sengupta, John Andrew Raine, Guillaume Quétant, Tobias Golling

Abstract:Building on the success of PC-JeDi we introduce PC-Droid, a substantially improved diffusion model for the generation of jet particle clouds. By leveraging a new diffusion formulation, studying more recent integration solvers, and training on all jet types simultaneously, we are able to achieve state-of-the-art performance for all types of jets across all evaluation metrics. We study the trade-off between generation speed and quality by comparing two attention based architectures, as well as the potential of consistency distillation to reduce the number of diffusion steps. Both the faster architecture and consistency models demonstrate performance surpassing many competing models, with generation time up to two orders of magnitude faster than PC-JeDi.

* 20 pages, 6 tables, 13 figures

Via

Access Paper or Ask Questions

Decorrelation using Optimal Transport

Jul 14, 2023

Malte Algren, John Andrew Raine, Tobias Golling

Abstract:Being able to decorrelate a feature space from protected attributes is an area of active research and study in ethics, fairness, and also natural sciences. We introduce a novel decorrelation method using Convex Neural Optimal Transport Solvers (Cnots) that is able to decorrelate a continuous feature space against protected attributes with optimal transport. We demonstrate how well it performs in the context of jet classification in high energy physics, where classifier scores are desired to be decorrelated from the mass of a jet. The decorrelation achieved in binary classification approaches the levels achieved by the state-of-the-art using conditional normalising flows. When moving to multiclass outputs the optimal transport approach performs significantly better than the state-of-the-art, suggesting substantial gains at decorrelating multidimensional feature spaces.

Via

Access Paper or Ask Questions

CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

May 08, 2023

Debajyoti Sengupta, Samuel Klein, John Andrew Raine, Tobias Golling

Figure 1 for CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Figure 2 for CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Figure 3 for CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Figure 4 for CURTAINs Flows For Flows: Constructing Unobserved Regions with Maximum Likelihood Estimation

Abstract:Model independent techniques for constructing background data templates using generative models have shown great promise for use in searches for new physics processes at the LHC. We introduce a major improvement to the CURTAINs method by training the conditional normalizing flow between two side-band regions using maximum likelihood estimation instead of an optimal transport loss. The new training objective improves the robustness and fidelity of the transformed data and is much faster and easier to train. We compare the performance against the previous approach and the current state of the art using the LHC Olympics anomaly detection dataset, where we see a significant improvement in sensitivity over the original CURTAINs method. Furthermore, CURTAINsF4F requires substantially less computational resources to cover a large number of signal regions than other fully data driven approaches. When using an efficient configuration, an order of magnitude more models can be trained in the same time required for ten signal regions, without a significant drop in performance.

* 19 pages, 10 figures, 4 tables

Via

Access Paper or Ask Questions

Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

Apr 28, 2023

Malte Algren, Tobias Golling, Manuel Guth, Chris Pollard, John Andrew Raine

Figure 1 for Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

Figure 2 for Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

Figure 3 for Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

Figure 4 for Flow Away your Differences: Conditional Normalizing Flows as an Improvement to Reweighting

Abstract:We present an alternative to reweighting techniques for modifying distributions to account for a desired change in an underlying conditional distribution, as is often needed to correct for mis-modelling in a simulated sample. We employ conditional normalizing flows to learn the full conditional probability distribution from which we sample new events for conditional values drawn from the target distribution to produce the desired, altered distribution. In contrast to common reweighting techniques, this procedure is independent of binning choice and does not rely on an estimate of the density ratio between two distributions. In several toy examples we show that normalizing flows outperform reweighting approaches to match the distribution of the target.We demonstrate that the corrected distribution closes well with the ground truth, and a statistical uncertainty on the training dataset can be ascertained with bootstrapping. In our examples, this leads to a statistical precision up to three times greater than using reweighting techniques with identical sample sizes for the source and target distributions. We also explore an application in the context of high energy particle physics.

* 21 pages, 9 figures

Via

Access Paper or Ask Questions

Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Mar 27, 2023

Lukas Ehrke, John Andrew Raine, Knut Zoch, Manuel Guth, Tobias Golling

Figure 1 for Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Figure 2 for Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Figure 3 for Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Figure 4 for Topological Reconstruction of Particle Physics Processes using Graph Neural Networks

Abstract:We present a new approach, the Topograph, which reconstructs underlying physics processes, including the intermediary particles, by leveraging underlying priors from the nature of particle physics decays and the flexibility of message passing graph neural networks. The Topograph not only solves the combinatoric assignment of observed final state objects, associating them to their original mother particles, but directly predicts the properties of intermediate particles in hard scatter processes and their subsequent decays. In comparison to standard combinatoric approaches or modern approaches using graph neural networks, which scale exponentially or quadratically, the complexity of Topographs scales linearly with the number of reconstructed objects. We apply Topographs to top quark pair production in the all hadronic decay channel, where we outperform the standard approach and match the performance of the state-of-the-art machine learning technique.

* 24 pages, 24 figures, 7 tables

Via

Access Paper or Ask Questions

PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

Mar 09, 2023

Matthew Leigh, Debajyoti Sengupta, Guillaume Quétant, John Andrew Raine, Knut Zoch, Tobias Golling

Figure 1 for PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

Figure 2 for PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

Figure 3 for PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

Figure 4 for PC-JeDi: Diffusion for Particle Cloud Generation in High Energy Physics

Abstract:In this paper, we present a new method to efficiently generate jets in High Energy Physics called PC-JeDi. This method utilises score-based diffusion models in conjunction with transformers which are well suited to the task of generating jets as particle clouds due to their permutation equivariance. PC-JeDi achieves competitive performance with current state-of-the-art methods across several metrics that evaluate the quality of the generated jets. Although slower than other models, due to the large number of forward passes required by diffusion models, it is still substantially faster than traditional detailed simulation. Furthermore, PC-JeDi uses conditional generation to produce jets with a desired mass and transverse momentum for two different particles, top quarks and gluons.

* 29 pages, 25 figures, 5 tables

Via

Access Paper or Ask Questions

Decorrelation with conditional normalizing flows

Nov 10, 2022

Samuel Klein, Tobias Golling

Abstract:The sensitivity of many physics analyses can be enhanced by constructing discriminants that preferentially select signal events. Such discriminants become much more useful if they are uncorrelated with a set of protected attributes. In this paper we show that a normalizing flow conditioned on the protected attributes can be used to find a decorrelated representation for any discriminant. As a normalizing flow is invertible the separation power of the resulting discriminant will be unchanged at any fixed value of the protected attributes. We demonstrate the efficacy of our approach by building supervised jet taggers that produce almost no sculpting in the mass distribution of the background.

Via

Access Paper or Ask Questions

Flows for Flows: Training Normalizing Flows Between Arbitrary Distributions with Maximum Likelihood Estimation

Nov 04, 2022

Samuel Klein, John Andrew Raine, Tobias Golling

Abstract:Normalizing flows are constructed from a base distribution with a known density and a diffeomorphism with a tractable Jacobian. The base density of a normalizing flow can be parameterised by a different normalizing flow, thus allowing maps to be found between arbitrary distributions. We demonstrate and explore the utility of this approach and show it is particularly interesting in the case of conditional normalizing flows and for introducing optimal transport constraints on maps that are constructed using normalizing flows.

Via

Access Paper or Ask Questions