Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pavlos Protopapas

Physics-Informed Neural Networks for Quantum Eigenvalue Problems

Feb 24, 2022

Henry Jin, Marios Mattheakis, Pavlos Protopapas

Figure 1 for Physics-Informed Neural Networks for Quantum Eigenvalue Problems

Figure 2 for Physics-Informed Neural Networks for Quantum Eigenvalue Problems

Figure 3 for Physics-Informed Neural Networks for Quantum Eigenvalue Problems

Figure 4 for Physics-Informed Neural Networks for Quantum Eigenvalue Problems

Abstract:Eigenvalue problems are critical to several fields of science and engineering. We expand on the method of using unsupervised neural networks for discovering eigenfunctions and eigenvalues for differential eigenvalue problems. The obtained solutions are given in an analytical and differentiable form that identically satisfies the desired boundary conditions. The network optimization is data-free and depends solely on the predictions of the neural network. We introduce two physics-informed loss functions. The first, called ortho-loss, motivates the network to discover pair-wise orthogonal eigenfunctions. The second loss term, called norm-loss, requests the discovery of normalized eigenfunctions and is used to avoid trivial solutions. We find that embedding even or odd symmetries to the neural network architecture further improves the convergence for relevant problems. Lastly, a patience condition can be used to automatically recognize eigenfunction solutions. This proposed unsupervised learning method is used to solve the finite well, multiple finite wells, and hydrogen atom eigenvalue quantum problems.

Via

Access Paper or Ask Questions

Building astroBERT, a language model for Astronomy & Astrophysics

Dec 01, 2021

Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald(+7 more)

Figure 1 for Building astroBERT, a language model for Astronomy & Astrophysics

Figure 2 for Building astroBERT, a language model for Astronomy & Astrophysics

Abstract:The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and more) without further clarification from the user. At ADS, we are applying modern machine learning and natural language processing techniques to our dataset of recent astronomy publications to train astroBERT, a deeply contextual language model based on research at Google. Using astroBERT, we aim to enrich the ADS dataset and improve its discoverability, and in particular we are developing our own named entity recognition tool. We present here our preliminary results and lessons learned.

Via

Access Paper or Ask Questions

Adversarial Sampling for Solving Differential Equations with Neural Networks

Nov 20, 2021

Kshitij Parwani, Pavlos Protopapas

Figure 1 for Adversarial Sampling for Solving Differential Equations with Neural Networks

Figure 2 for Adversarial Sampling for Solving Differential Equations with Neural Networks

Figure 3 for Adversarial Sampling for Solving Differential Equations with Neural Networks

Figure 4 for Adversarial Sampling for Solving Differential Equations with Neural Networks

Abstract:Neural network-based methods for solving differential equations have been gaining traction. They work by improving the differential equation residuals of a neural network on a sample of points in each iteration. However, most of them employ standard sampling schemes like uniform or perturbing equally spaced points. We present a novel sampling scheme which samples points adversarially to maximize the loss of the current solution estimate. A sampler architecture is described along with the loss terms used for training. Finally, we demonstrate that this scheme outperforms pre-existing schemes by comparing both on a number of problems.

Via

Access Paper or Ask Questions

Uncertainty Quantification in Neural Differential Equations

Nov 08, 2021

Olga Graf, Pablo Flores, Pavlos Protopapas, Karim Pichara

Figure 1 for Uncertainty Quantification in Neural Differential Equations

Figure 2 for Uncertainty Quantification in Neural Differential Equations

Abstract:Uncertainty quantification (UQ) helps to make trustworthy predictions based on collected observations and uncertain domain knowledge. With increased usage of deep learning in various applications, the need for efficient UQ methods that can make deep models more reliable has increased as well. Among applications that can benefit from effective handling of uncertainty are the deep learning based differential equation (DE) solvers. We adapt several state-of-the-art UQ methods to get the predictive uncertainty for DE solutions and show the results on four different DE types.

Via

Access Paper or Ask Questions

Multi-Task Learning based Convolutional Models with Curriculum Learning for the Anisotropic Reynolds Stress Tensor in Turbulent Duct Flow

Oct 30, 2021

Haitz Sáez de Ocáriz Borde, David Sondak, Pavlos Protopapas

Figure 1 for Multi-Task Learning based Convolutional Models with Curriculum Learning for the Anisotropic Reynolds Stress Tensor in Turbulent Duct Flow

Figure 2 for Multi-Task Learning based Convolutional Models with Curriculum Learning for the Anisotropic Reynolds Stress Tensor in Turbulent Duct Flow

Figure 3 for Multi-Task Learning based Convolutional Models with Curriculum Learning for the Anisotropic Reynolds Stress Tensor in Turbulent Duct Flow

Figure 4 for Multi-Task Learning based Convolutional Models with Curriculum Learning for the Anisotropic Reynolds Stress Tensor in Turbulent Duct Flow

Abstract:The Reynolds-averaged Navier-Stokes (RANS) equations require accurate modeling of the anisotropic Reynolds stress tensor, for which traditional closure models only give good results in certain flow configurations. Researchers have started using machine learning approaches to address this problem. In this work we build upon recent convolutional neural network architectures used for turbulence modeling and propose a multi-task learning based fully convolutional neural network that is able to accurately predict the normalized anisotropic Reynolds stress tensor for turbulent duct flow. Furthermore, we also explore the application of curriculum learning to data-driven turbulence modeling.

Via

Access Paper or Ask Questions

One-Shot Transfer Learning of Physics-Informed Neural Networks

Oct 21, 2021

Shaan Desai, Marios Mattheakis, Hayden Joy, Pavlos Protopapas, Stephen Roberts

Figure 1 for One-Shot Transfer Learning of Physics-Informed Neural Networks

Figure 2 for One-Shot Transfer Learning of Physics-Informed Neural Networks

Figure 3 for One-Shot Transfer Learning of Physics-Informed Neural Networks

Figure 4 for One-Shot Transfer Learning of Physics-Informed Neural Networks

Abstract:Solving differential equations efficiently and accurately sits at the heart of progress in many areas of scientific research, from classical dynamical systems to quantum mechanics. There is a surge of interest in using Physics-Informed Neural Networks (PINNs) to tackle such problems as they provide numerous benefits over traditional numerical approaches. Despite their potential benefits for solving differential equations, transfer learning has been under explored. In this study, we present a general framework for transfer learning PINNs that results in one-shot inference for linear systems of both ordinary and partial differential equations. This means that highly accurate solutions to many unknown differential equations can be obtained instantaneously without retraining an entire network. We demonstrate the efficacy of the proposed deep learning approach by solving several real-world problems, such as first- and second-order linear ordinary equations, the Poisson equation, and the time-dependent Schrodinger complex-value partial differential equation.

* [under review]

Via

Access Paper or Ask Questions

Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

Aug 25, 2021

Marios Mattheakis, Hayden Joy, Pavlos Protopapas

Figure 1 for Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

Figure 2 for Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

Figure 3 for Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

Figure 4 for Unsupervised Reservoir Computing for Solving Ordinary Differential Equations

Abstract:There is a wave of interest in using unsupervised neural networks for solving differential equations. The existing methods are based on feed-forward networks, {while} recurrent neural network differential equation solvers have not yet been reported. We introduce an unsupervised reservoir computing (RC), an echo-state recurrent neural network capable of discovering approximate solutions that satisfy ordinary differential equations (ODEs). We suggest an approach to calculate time derivatives of recurrent neural network outputs without using backpropagation. The internal weights of an RC are fixed, while only a linear output layer is trained, yielding efficient training. However, RC performance strongly depends on finding the optimal hyper-parameters, which is a computationally expensive process. We use Bayesian optimization to efficiently discover optimal sets in a high-dimensional hyper-parameter space and numerically show that one set is robust and can be used to solve an ODE for different initial conditions and time ranges. A closed-form formula for the optimal output weights is derived to solve first order linear equations in a backpropagation-free learning process. We extend the RC approach by solving nonlinear system of ODEs using a hybrid optimization method consisting of gradient descent and Bayesian optimization. Evaluation of linear and nonlinear systems of equations demonstrates the efficiency of the RC ODE solver.

Via

Access Paper or Ask Questions

Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems

Jul 16, 2021

Shaan Desai, Marios Mattheakis, David Sondak, Pavlos Protopapas, Stephen Roberts

Abstract:Accurately learning the temporal behavior of dynamical systems requires models with well-chosen learning biases. Recent innovations embed the Hamiltonian and Lagrangian formalisms into neural networks and demonstrate a significant improvement over other approaches in predicting trajectories of physical systems. These methods generally tackle autonomous systems that depend implicitly on time or systems for which a control signal is known apriori. Despite this success, many real world dynamical systems are non-autonomous, driven by time-dependent forces and experience energy dissipation. In this study, we address the challenge of learning from such non-autonomous systems by embedding the port-Hamiltonian formalism into neural networks, a versatile framework that can capture energy dissipation and time-dependent control forces. We show that the proposed \emph{port-Hamiltonian neural network} can efficiently learn the dynamics of nonlinear physical systems of practical interest and accurately recover the underlying stationary Hamiltonian, time-dependent force, and dissipative coefficient. A promising outcome of our network is its ability to learn and predict chaotic systems such as the Duffing equation, for which the trajectories are typically hard to learn.

* [under review]

Via

Access Paper or Ask Questions

Encoding Involutory Invariance in Neural Networks

Jun 07, 2021

Anwesh Bhattacharya, Marios Mattheakis, Pavlos Protopapas

Figure 1 for Encoding Involutory Invariance in Neural Networks

Figure 2 for Encoding Involutory Invariance in Neural Networks

Figure 3 for Encoding Involutory Invariance in Neural Networks

Figure 4 for Encoding Involutory Invariance in Neural Networks

Abstract:In certain situations, Neural Networks (NN) are trained upon data that obey underlying physical symmetries. However, it is not guaranteed that NNs will obey the underlying symmetry unless embedded in the network structure. In this work, we explore a special kind of symmetry where functions are invariant with respect to involutory linear/affine transformations up to parity $p=\pm 1$. We develop mathematical theorems and propose NN architectures that ensure invariance and universal approximation properties. Numerical experiments indicate that the proposed models outperform baseline networks while respecting the imposed symmetry. An adaption of our technique to convolutional NN classification tasks for datasets with inherent horizontal/vertical reflection symmetry has also been proposed.

* 19 pages, 12 figures

Via

Access Paper or Ask Questions

A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

Jan 15, 2021

Tiago A. E. Ferreira, Marios Mattheakis, Pavlos Protopapas

Figure 1 for A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

Figure 2 for A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

Figure 3 for A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

Figure 4 for A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function

Abstract:The activation function plays a fundamental role in the artificial neural network learning process. However, there is no obvious choice or procedure to determine the best activation function, which depends on the problem. This study proposes a new artificial neuron, named global-local neuron, with a trainable activation function composed of two components, a global and a local. The global component term used here is relative to a mathematical function to describe a general feature present in all problem domain. The local component is a function that can represent a localized behavior, like a transient or a perturbation. This new neuron can define the importance of each activation function component in the learning phase. Depending on the problem, it results in a purely global, or purely local, or a mixed global and local activation function after the training phase. Here, the trigonometric sine function was employed for the global component and the hyperbolic tangent for the local component. The proposed neuron was tested for problems where the target was a purely global function, or purely local function, or a composition of two global and local functions. Two classes of test problems were investigated, regression problems and differential equations solving. The experimental tests demonstrated the Global-Local Neuron network's superior performance, compared with simple neural networks with sine or hyperbolic tangent activation function, and with a hybrid network that combines these two simple neural networks.

Via

Access Paper or Ask Questions