Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alex Bihlo

Learning vertical coordinates via automatic differentiation of a dynamical core

Dec 19, 2025

Tim Whittaker, Seth Taylor, Elsa Cardoso-Bihlo, Alejandro Di Luca, Alex Bihlo

Figure 1 for Learning vertical coordinates via automatic differentiation of a dynamical core

Figure 2 for Learning vertical coordinates via automatic differentiation of a dynamical core

Figure 3 for Learning vertical coordinates via automatic differentiation of a dynamical core

Figure 4 for Learning vertical coordinates via automatic differentiation of a dynamical core

Abstract:Terrain-following coordinates in atmospheric models often imprint their grid structure onto the solution, particularly over steep topography, where distorted coordinate layers can generate spurious horizontal and vertical motion. Standard formulations, such as hybrid or SLEVE coordinates, mitigate these errors by using analytic decay functions controlled by heuristic scale parameters that are typically tuned by hand and fixed a priori. In this work, we propose a framework to define a parametric vertical coordinate system as a learnable component within a differentiable dynamical core. We develop an end-to-end differentiable numerical solver for the two-dimensional non-hydrostatic Euler equations on an Arakawa C-grid, and introduce a NEUral Vertical Enhancement (NEUVE) terrain-following coordinate based on an integral transformed neural network that guarantees monotonicity. A key feature of our approach is the use of automatic differentiation to compute exact geometric metric terms, thereby eliminating truncation errors associated with finite-difference coordinate derivatives. By coupling simulation errors through the time integration to the parameterization, our formulation finds a grid structure optimized for both the underlying physics and numerics. Using several standard tests, we demonstrate that these learned coordinates reduce the mean squared error by a factor of 1.4 to 2 in non-linear statistical benchmarks, and eliminate spurious vertical velocity striations over steep topography.

Via

Access Paper or Ask Questions

ForeCite: Adapting Pre-Trained Language Models to Predict Future Citation Rates of Academic Papers

May 13, 2025

Gavin Hull, Alex Bihlo

Abstract:Predicting the future citation rates of academic papers is an important step toward the automation of research evaluation and the acceleration of scientific progress. We present $\textbf{ForeCite}$, a simple but powerful framework to append pre-trained causal language models with a linear head for average monthly citation rate prediction. Adapting transformers for regression tasks, ForeCite achieves a test correlation of $\rho = 0.826$ on a curated dataset of 900K+ biomedical papers published between 2000 and 2024, a 27-point improvement over the previous state-of-the-art. Comprehensive scaling-law analysis reveals consistent gains across model sizes and data volumes, while temporal holdout experiments confirm practical robustness. Gradient-based saliency heatmaps suggest a potentially undue reliance on titles and abstract texts. These results establish a new state-of-the-art in forecasting the long-term influence of academic research and lay the groundwork for the automated, high-fidelity evaluation of scientific contributions.

* 16 pages, 13 figures

Via

Access Paper or Ask Questions

Mapping Galaxy Images Across Ultraviolet, Visible and Infrared Bands Using Generative Deep Learning

Jan 25, 2025

Youssef Zaazou, Alex Bihlo, Terrence S. Tricco

Abstract:We demonstrate that generative deep learning can translate galaxy observations across ultraviolet, visible, and infrared photometric bands. Leveraging mock observations from the Illustris simulations, we develop and validate a supervised image-to-image model capable of performing both band interpolation and extrapolation. The resulting trained models exhibit high fidelity in generating outputs, as verified by both general image comparison metrics (MAE, SSIM, PSNR) and specialized astronomical metrics (GINI coefficient, M20). Moreover, we show that our model can be used to predict real-world observations, using data from the DECaLS survey as a case study. These findings highlight the potential of generative learning to augment astronomical datasets, enabling efficient exploration of multi-band information in regions where observations are incomplete. This work opens new pathways for optimizing mission planning, guiding high-resolution follow-ups, and enhancing our understanding of galaxy morphology and evolution.

* 15 pages, 6 figures, Submitted to ApJ, GitHub: https://github.com/yazaazou/Galaxy-Band-Conversion

Via

Access Paper or Ask Questions

PinnDE: Physics-Informed Neural Networks for Solving Differential Equations

Aug 19, 2024

Jason Matthews, Alex Bihlo

Abstract:In recent years the study of deep learning for solving differential equations has grown substantially. The use of physics-informed neural networks (PINNs) and deep operator networks (DeepONets) have emerged as two of the most useful approaches in approximating differential equation solutions using machine learning. Here, we propose PinnDE, an open-source python library for solving differential equations with both PINNs and DeepONets. We give a brief review of both PINNs and DeepONets, introduce PinnDE along with the structure and usage of the package, and present worked examples to show PinnDE's effectiveness in approximating solutions with both PINNs and DeepONets.

Via

Access Paper or Ask Questions

Exactly conservative physics-informed neural networks and deep operator networks for dynamical systems

Nov 23, 2023

Elsa Cardoso-Bihlo, Alex Bihlo

Abstract:We introduce a method for training exactly conservative physics-informed neural networks and physics-informed deep operator networks for dynamical systems. The method employs a projection-based technique that maps a candidate solution learned by the neural network solver for any given dynamical system possessing at least one first integral onto an invariant manifold. We illustrate that exactly conservative physics-informed neural network solvers and physics-informed deep operator networks for dynamical systems vastly outperform their non-conservative counterparts for several real-world problems from the mathematical sciences.

* 12 pages, 6 figures, 1 algorithm

Via

Access Paper or Ask Questions

Improving physics-informed DeepONets with hard constraints

Sep 14, 2023

Rüdiger Brecht, Dmytro R. Popovych, Alex Bihlo, Roman O. Popovych

Abstract:Current physics-informed (standard or operator) neural networks still rely on accurately learning the initial conditions of the system they are solving. In contrast, standard numerical methods evolve such initial conditions without needing to learn these. In this study, we propose to improve current physics-informed deep learning strategies such that initial conditions do not need to be learned and are represented exactly in the predicted solution. Moreover, this method guarantees that when a DeepONet is applied multiple times to time step a solution, the resulting function is continuous.

* 15 pages, 5 figures, 4 tables; release version

Via

Access Paper or Ask Questions

Towards replacing precipitation ensemble predictions systems using machine learning

Apr 20, 2023

Rüdiger Brecht, Alex Bihlo

Abstract:Precipitation forecasts are less accurate compared to other meteorological fields because several key processes affecting precipitation distribution and intensity occur below the resolved scale of global weather prediction models. This requires to use higher resolution simulations. To generate an uncertainty prediction associated with the forecast, ensembles of simulations are run simultaneously. However, the computational cost is a limiting factor here. Thus, instead of generating an ensemble system from simulations there is a trend of using neural networks. Unfortunately the data for high resolution ensemble runs is not available. We propose a new approach to generating ensemble weather predictions for high-resolution precipitation without requiring high-resolution training data. The method uses generative adversarial networks to learn the complex patterns of precipitation and produce diverse and realistic precipitation fields, allowing to generate realistic precipitation ensemble members using only the available control forecast. We demonstrate the feasibility of generating realistic precipitation ensemble members on unseen higher resolutions. We use evaluation metrics such as RMSE, CRPS, rank histogram and ROC curves to demonstrate that our generated ensemble is almost identical to the ECMWF IFS ensemble.

* 12 pages, 7 figures, 2 tables

Via

Access Paper or Ask Questions

M-ENIAC: A machine learning recreation of the first successful numerical weather forecasts

Apr 18, 2023

Rüdiger Brecht, Alex Bihlo

Abstract:In 1950 the first successful numerical weather forecast was obtained by solving the barotropic vorticity equation using the Electronic Numerical Integrator and Computer (ENIAC), which marked the beginning of the age of numerical weather prediction. Here, we ask the question of how these numerical forecasts would have turned out, if machine learning based solvers had been used instead of standard numerical discretizations. Specifically, we recreate these numerical forecasts using physics-informed neural networks. We show that physics-informed neural networks provide an easier and more accurate methodology for solving meteorological equations on the sphere, as compared to the ENIAC solver.

* 10 pages, 1 figure

Via

Access Paper or Ask Questions

Improving physics-informed neural networks with meta-learned optimization

Mar 14, 2023

Alex Bihlo

Abstract:We show that the error achievable using physics-informed neural networks for solving systems of differential equations can be substantially reduced when these networks are trained using meta-learned optimization methods rather than to using fixed, hand-crafted optimizers as traditionally done. We choose a learnable optimization method based on a shallow multi-layer perceptron that is meta-trained for specific classes of differential equations. We illustrate meta-trained optimizers for several equations of practical relevance in mathematical physics, including the linear advection equation, Poisson's equation, the Korteweg--de Vries equation and Burgers' equation. We also illustrate that meta-learned optimizers exhibit transfer learning abilities, in that a meta-trained optimizer on one differential equation can also be successfully deployed on another differential equation.

* 15 pages, 10 figures

Via

Access Paper or Ask Questions

Model-free machine learning of conservation laws from data

Jan 12, 2023

Shivam Arora, Alex Bihlo, Rüdiger Brecht, Pavel Holba

Figure 1 for Model-free machine learning of conservation laws from data

Figure 2 for Model-free machine learning of conservation laws from data

Figure 3 for Model-free machine learning of conservation laws from data

Figure 4 for Model-free machine learning of conservation laws from data

Abstract:We present a machine learning based method for learning first integrals of systems of ordinary differential equations from given trajectory data. The method is model-free in that it does not require explicit knowledge of the underlying system of differential equations that generated the trajectories. As a by-product, once the first integrals have been learned, also the system of differential equations will be known. We illustrate our method by considering several classical problems from the mathematical sciences.

* 18 pages, 8 figures

Via

Access Paper or Ask Questions