Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aashwin Mishra

A Start To End Machine Learning Approach To Maximize Scientific Throughput From The LCLS-II-HE

May 29, 2025

Aashwin Mishra, Matt Seaberg, Ryan Roussel, Fred Poitevin, Jana Thayer, Daniel Ratner, Auralee Edelen, Apurva Mehta

Abstract:With the increasing brightness of Light sources, including the Diffraction-Limited brightness upgrade of APS and the high-repetition-rate upgrade of LCLS, the proposed experiments therein are becoming increasingly complex. For instance, experiments at LCLS-II-HE will require the X-ray beam to be within a fraction of a micron in diameter, with pointing stability of a few nanoradians, at the end of a kilometer-long electron accelerator, a hundred-meter-long undulator section, and tens of meters long X-ray optics. This enhancement of brightness will increase the data production rate to rival the largest data generators in the world. Without real-time active feedback control and an optimized pipeline to transform measurements to scientific information and insights, researchers will drown in a deluge of mostly useless data, and fail to extract the highly sophisticated insights that the recent brightness upgrades promise. In this article, we outline the strategy we are developing at SLAC to implement Machine Learning driven optimization, automation and real-time knowledge extraction from the electron-injector at the start of the electron accelerator, to the multidimensional X-ray optical systems, and till the experimental endstations and the high readout rate, multi-megapixel detectors at LCLS to deliver the design performance to the users. This is illustrated via examples from Accelerator, Optics and End User applications.

Via

Access Paper or Ask Questions

Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

Oct 10, 2024

Felix Petersen, Christian Borgelt, Aashwin Mishra, Stefano Ermon

Figure 1 for Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

Figure 2 for Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

Figure 3 for Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

Figure 4 for Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

Abstract:We deal with the problem of gradient estimation for stochastic differentiable relaxations of algorithms, operators, simulators, and other non-differentiable functions. Stochastic smoothing conventionally perturbs the input of a non-differentiable function with a differentiable density distribution with full support, smoothing it and enabling gradient estimation. Our theory starts at first principles to derive stochastic smoothing with reduced assumptions, without requiring a differentiable density nor full support, and we present a general framework for relaxation and gradient estimation of non-differentiable black-box functions $f:\mathbb{R}^n\to\mathbb{R}^m$. We develop variance reduction for gradient estimation from 3 orthogonal perspectives. Empirically, we benchmark 6 distributions and up to 24 variance reduction strategies for differentiable sorting and ranking, differentiable shortest-paths on graphs, differentiable rendering for pose estimation, as well as differentiable cryo-ET simulations.

Via

Access Paper or Ask Questions

Uncertainty Quantification via Stable Distribution Propagation

Feb 13, 2024

Felix Petersen, Aashwin Mishra, Hilde Kuehne, Christian Borgelt, Oliver Deussen, Mikhail Yurochkin

Figure 1 for Uncertainty Quantification via Stable Distribution Propagation

Figure 2 for Uncertainty Quantification via Stable Distribution Propagation

Figure 3 for Uncertainty Quantification via Stable Distribution Propagation

Figure 4 for Uncertainty Quantification via Stable Distribution Propagation

Abstract:We propose a new approach for propagating stable probability distributions through neural networks. Our method is based on local linearization, which we show to be an optimal approximation in terms of total variation distance for the ReLU non-linearity. This allows propagating Gaussian and Cauchy input uncertainties through neural networks to quantify their output uncertainties. To demonstrate the utility of propagating distributions, we apply the proposed method to predicting calibrated confidence intervals and selective prediction on out-of-distribution data. The results demonstrate a broad applicability of propagating distributions and show the advantages of our method over other approaches such as moment matching.

* Published at ICLR 2024, Code @ https://github.com/Felix-Petersen/distprop

Via

Access Paper or Ask Questions