Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maurizio Pierini

Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Jun 26, 2021

Zhiqiang Que, Erwei Wang, Umar Marikar, Eric Moreno, Jennifer Ngadiuba, Hamza Javed, Bartłomiej Borzyszkowski, Thea Aarrestad, Vladimir Loncar, Sioni Summers(+3 more)

Figure 1 for Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Figure 2 for Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Figure 3 for Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Figure 4 for Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

Abstract:This paper presents novel reconfigurable architectures for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves. Gravitational interferometers such as the LIGO detectors capture cosmic events such as black hole mergers which happen at unknown times and of varying durations, producing time-series data. We have developed a new architecture capable of accelerating RNN inference for analyzing time-series data from LIGO detectors. This architecture is based on optimizing the initiation intervals (II) in a multi-layer LSTM (Long Short-Term Memory) network, by identifying appropriate reuse factors for each layer. A customizable template for this architecture has been designed, which enables the generation of low-latency FPGA designs with efficient resource utilization using high-level synthesis tools. The proposed approach has been evaluated based on two LSTM models, targeting a ZYNQ 7045 FPGA and a U250 FPGA. Experimental results show that with balanced II, the number of DSPs can be reduced up to 42% while achieving the same IIs. When compared to other FPGA-based LSTM designs, our design can achieve about 4.92 to 12.4 times lower latency.

* Accepted at the 2021 32nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Via

Access Paper or Ask Questions

Particle Cloud Generation with Message Passing Generative Adversarial Networks

Jun 22, 2021

Raghav Kansal, Javier Duarte, Hao Su, Breno Orzari, Thiago Tomei, Maurizio Pierini, Mary Touranakou, Jean-Roch Vlimant, Dimitrios Gunopulos

Figure 1 for Particle Cloud Generation with Message Passing Generative Adversarial Networks

Figure 2 for Particle Cloud Generation with Message Passing Generative Adversarial Networks

Figure 3 for Particle Cloud Generation with Message Passing Generative Adversarial Networks

Figure 4 for Particle Cloud Generation with Message Passing Generative Adversarial Networks

Abstract:In high energy physics (HEP), jets are collections of correlated particles produced ubiquitously in particle collisions such as those at the CERN Large Hadron Collider (LHC). Machine-learning-based generative models, such as generative adversarial networks (GANs), have the potential to significantly accelerate LHC jet simulations. However, despite jets having a natural representation as a set of particles in momentum-space, a.k.a. a particle cloud, to our knowledge there exist no generative models applied to such a dataset. We introduce a new particle cloud dataset (JetNet), and, due to similarities between particle and point clouds, apply to it existing point cloud GANs. Results are evaluated using (1) the 1-Wasserstein distance between high- and low-level feature distributions, (2) a newly developed Fr\'{e}chet ParticleNet Distance, and (3) the coverage and (4) minimum matching distance metrics. Existing GANs are found to be inadequate for physics applications, hence we develop a new message passing GAN (MPGAN), which outperforms existing point cloud GANs on virtually every metric and shows promise for use in HEP. We propose JetNet as a novel point-cloud-style dataset for the machine learning community to experiment with, and set MPGAN as a benchmark to improve upon for future generative models.

* 13 pages, 4 figures, 2 tables, and a 3 page appendix

Via

Access Paper or Ask Questions

A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

May 04, 2021

Giuseppe Di Guglielmo, Farah Fahim, Christian Herwig, Manuel Blanco Valentin, Javier Duarte, Cristian Gingu, Philip Harris, James Hirschauer, Martin Kwok, Vladimir Loncar(+8 more)

Figure 1 for A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

Figure 2 for A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

Figure 3 for A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

Figure 4 for A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

Abstract:Despite advances in the programmable logic capabilities of modern trigger systems, a significant bottleneck remains in the amount of data to be transported from the detector to off-detector logic where trigger decisions are made. We demonstrate that a neural network autoencoder model can be implemented in a radiation tolerant ASIC to perform lossy data compression alleviating the data transmission problem while preserving critical information of the detector energy profile. For our application, we consider the high-granularity calorimeter from the CMS experiment at the CERN Large Hadron Collider. The advantage of the machine learning approach is in the flexibility and configurability of the algorithm. By changing the neural network weights, a unique data compression algorithm can be deployed for each sensor in different detector regions, and changing detector or collider conditions. To meet area, performance, and power constraints, we perform a quantization-aware training to create an optimized neural network hardware implementation. The design is achieved through the use of high-level synthesis tools and the hls4ml framework, and was processed through synthesis and physical layout flows based on a LP CMOS 65 nm technology node. The flow anticipates 200 Mrad of ionizing radiation to select gates, and reports a total area of 3.6 mm^2 and consumes 95 mW of power. The simulated energy consumption per inference is 2.4 nJ. This is the first radiation tolerant on-detector ASIC implementation of a neural network that has been designed for particle physics applications.

* 9 pages, 8 figures, 3 tables

Via

Access Paper or Ask Questions

hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Mar 23, 2021

Farah Fahim, Benjamin Hawks, Christian Herwig, James Hirschauer, Sergo Jindariani, Nhan Tran, Luca P. Carloni, Giuseppe Di Guglielmo, Philip Harris, Jeffrey Krupa(+20 more)

Figure 1 for hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Figure 2 for hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Figure 3 for hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Figure 4 for hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

Abstract:Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. To support domain scientists, we have developed hls4ml, an open-source software-hardware codesign workflow to interpret and translate machine learning algorithms for implementation with both FPGA and ASIC technologies. We expand on previous hls4ml work by extending capabilities and techniques towards low-power implementations and increased usability: new Python APIs, quantization-aware pruning, end-to-end FPGA workflows, long pipeline kernels for low power, and new device backends include an ASIC workflow. Taken together, these and continued efforts in hls4ml will arm a new generation of domain scientists with accessible, efficient, and powerful tools for machine-learning-accelerated discovery.

* 10 pages, 8 figures, TinyML Research Symposium 2021

Via

Access Paper or Ask Questions

MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Jan 21, 2021

Joosep Pata, Javier Duarte, Jean-Roch Vlimant, Maurizio Pierini, Maria Spiropulu

Figure 1 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 2 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 3 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 4 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Abstract:In general-purpose particle detectors, the particle flow algorithm may be used to reconstruct a coherent particle-level view of the event by combining information from the calorimeters and the trackers, significantly improving the detector resolution for jets and the missing transverse momentum. In view of the planned high-luminosity upgrade of the CERN Large Hadron Collider, it is necessary to revisit existing reconstruction algorithms and ensure that both the physics and computational performance are sufficient in a high-pileup environment. Recent developments in machine learning may offer a prospect for efficient event reconstruction based on parametric models. We introduce MLPF, an end-to-end trainable machine-learned particle flow algorithm for reconstructing particle flow candidates based on parallelizable, computationally efficient, scalable graph neural networks and a multi-task objective. We report the physics and computational performance of the MLPF algorithm on on a synthetic dataset of ttbar events in HL-LHC running conditions, including the simulation of multiple interaction effects, and discuss potential next steps and considerations towards ML-based reconstruction in a general purpose particle detector.

* 12 pages, 10 figures

Via

Access Paper or Ask Questions

Fast convolutional neural networks on FPGAs with hls4ml

Jan 13, 2021

Thea Aarrestad, Vladimir Loncar, Maurizio Pierini, Sioni Summers, Jennifer Ngadiuba, Christoffer Petersson, Hampus Linander, Yutaro Iiyama, Giuseppe Di Guglielmo, Javier Duarte(+9 more)

Figure 1 for Fast convolutional neural networks on FPGAs with hls4ml

Figure 2 for Fast convolutional neural networks on FPGAs with hls4ml

Figure 3 for Fast convolutional neural networks on FPGAs with hls4ml

Figure 4 for Fast convolutional neural networks on FPGAs with hls4ml

Abstract:We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with large convolutional layers on FPGAs. By extending the hls4ml library, we demonstrate how to achieve inference latency of $5\,\mu$s using convolutional architectures, while preserving state-of-the-art model performance. Considering benchmark models trained on the Street View House Numbers Dataset, we demonstrate various methods for model compression in order to fit the computational constraints of a typical FPGA device. In particular, we discuss pruning and quantization-aware training, and demonstrate how resource utilization can be reduced by over 90% while maintaining the original model accuracy.

* 18 pages, 16 figures, 3 tables

Via

Access Paper or Ask Questions

Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Dec 08, 2020

Raghav Kansal, Javier Duarte, Breno Orzari, Thiago Tomei, Maurizio Pierini, Mary Touranakou, Jean-Roch Vlimant, Dimitrios Gunopoulos

Figure 1 for Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Figure 2 for Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Figure 3 for Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Figure 4 for Graph Generative Adversarial Networks for Sparse Data Generation in High Energy Physics

Abstract:We develop a graph generative adversarial network to generate sparse data sets like those produced at the CERN Large Hadron Collider (LHC). We demonstrate this approach by training on and generating sparse representations of MNIST handwritten digit images and jets of particles in proton-proton collisions like those at the LHC. We find the model successfully generates sparse MNIST digits and particle jet data. We quantify agreement between real and generated data with a graph-based Fr\'echet Inception distance, and the particle and jet feature-level 1-Wasserstein distance for the MNIST and jet datasets respectively.

* 9 pages, 4 figures, 4 tables, To appear in Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020)

Via

Access Paper or Ask Questions

Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Nov 30, 2020

Aneesh Heintz, Vesal Razavimaleki, Javier Duarte, Gage DeZoort, Isobel Ojalvo, Savannah Thais, Markus Atkinson, Mark Neubauer, Lindsey Gray, Sergo Jindariani(+11 more)

Figure 1 for Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Figure 2 for Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Figure 3 for Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Figure 4 for Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

Abstract:We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.

* 8 pages, 4 figures, To appear in Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020)

Via

Access Paper or Ask Questions

Anomaly Detection With Conditional Variational Autoencoders

Oct 12, 2020

Adrian Alan Pol, Victor Berger, Gianluca Cerminara, Cecile Germain, Maurizio Pierini

Figure 1 for Anomaly Detection With Conditional Variational Autoencoders

Figure 2 for Anomaly Detection With Conditional Variational Autoencoders

Figure 3 for Anomaly Detection With Conditional Variational Autoencoders

Figure 4 for Anomaly Detection With Conditional Variational Autoencoders

Abstract:Exploiting the rapid advances in probabilistic inference, in particular variational Bayes and variational autoencoders (VAEs), for anomaly detection (AD) tasks remains an open research question. Previous works argued that training VAE models only with inliers is insufficient and the framework should be significantly modified in order to discriminate the anomalous instances. In this work, we exploit the deep conditional variational autoencoder (CVAE) and we define an original loss function together with a metric that targets hierarchically structured data AD. Our motivating application is a real world problem: monitoring the trigger system which is a basic component of many particle physics experiments at the CERN Large Hadron Collider (LHC). In the experiments we show the superior performance of this method for classical machine learning (ML) benchmarks and for our application.

* Presented at ICMLA 2019

Via

Access Paper or Ask Questions

Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Oct 05, 2020

Cheng Chen, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

Figure 1 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 2 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 3 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 4 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Abstract:We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation level, i.e., in absence of detector effects. Based on this model, we propose a novel fast-simulation workflow that starts from a large amount of generator-level events to deliver large analysis-specific samples. The adoption of this approach would result in about an order-of-magnitude reduction in computing and storage requirements for the collision simulation workflow. This strategy could help the high energy physics community to face the computing challenges of the future High-Luminosity LHC.

* 15 pages, 12 figures

Via

Access Paper or Ask Questions