Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Johannes Brandstetter

Learning Lagrangian Fluid Mechanics with E($3$)-Equivariant Graph Neural Networks

May 24, 2023
Artur P. Toshev, Gianluca Galletti, Johannes Brandstetter, Stefan Adami, Nikolaus A. Adams

Figure 1 for Learning Lagrangian Fluid Mechanics with E($3$)-Equivariant Graph Neural Networks

Figure 2 for Learning Lagrangian Fluid Mechanics with E($3$)-Equivariant Graph Neural Networks

Figure 3 for Learning Lagrangian Fluid Mechanics with E($3$)-Equivariant Graph Neural Networks

Figure 4 for Learning Lagrangian Fluid Mechanics with E($3$)-Equivariant Graph Neural Networks

We contribute to the vastly growing field of machine learning for engineering systems by demonstrating that equivariant graph neural networks have the potential to learn more accurate dynamic-interaction models than their non-equivariant counterparts. We benchmark two well-studied fluid-flow systems, namely 3D decaying Taylor-Green vortex and 3D reverse Poiseuille flow, and evaluate the models based on different performance measures, such as kinetic energy or Sinkhorn distance. In addition, we investigate different embedding methods of physical-information histories for equivariant models. We find that while currently being rather slow to train and evaluate, equivariant models with our proposed history embeddings learn more accurate physical interactions.

* GSI'23 6th International Conference on Geometric Science of Information; 10 pages; oral. arXiv admin note: substantial text overlap with arXiv:2304.00150

Via

Access Paper or Ask Questions

Clifford Group Equivariant Neural Networks

May 18, 2023
David Ruhe, Johannes Brandstetter, Patrick Forré

Figure 1 for Clifford Group Equivariant Neural Networks

Figure 2 for Clifford Group Equivariant Neural Networks

Figure 3 for Clifford Group Equivariant Neural Networks

Figure 4 for Clifford Group Equivariant Neural Networks

We introduce Clifford Group Equivariant Neural Networks: a novel approach for constructing $\mathrm{E}(n)$-equivariant networks. We identify and study the $\textit{Clifford group}$, a subgroup inside the Clifford algebra, whose definition we slightly adjust to achieve several favorable properties. Primarily, the group's action forms an orthogonal automorphism that extends beyond the typical vector space to the entire Clifford algebra while respecting the multivector grading. This leads to several non-equivalent subrepresentations corresponding to the multivector decomposition. Furthermore, we prove that the action respects not just the vector space structure of the Clifford algebra but also its multiplicative structure, i.e., the geometric product. These findings imply that every polynomial in multivectors, including their grade projections, constitutes an equivariant map with respect to the Clifford group, allowing us to parameterize equivariant neural network layers. Notable advantages are that these layers operate directly on a vector basis and elegantly generalize to any dimension. We demonstrate, notably from a single core implementation, state-of-the-art performance on several distinct tasks, including a three-dimensional $n$-body experiment, a four-dimensional Lorentz-equivariant high-energy physics experiment, and a five-dimensional convex hull experiment.

Via

Access Paper or Ask Questions

E($3$) Equivariant Graph Neural Networks for Particle-Based Fluid Mechanics

Mar 31, 2023
Artur P. Toshev, Gianluca Galletti, Johannes Brandstetter, Stefan Adami, Nikolaus A. Adams

Figure 1 for E($3$) Equivariant Graph Neural Networks for Particle-Based Fluid Mechanics

Figure 2 for E($3$) Equivariant Graph Neural Networks for Particle-Based Fluid Mechanics

Figure 3 for E($3$) Equivariant Graph Neural Networks for Particle-Based Fluid Mechanics

We contribute to the vastly growing field of machine learning for engineering systems by demonstrating that equivariant graph neural networks have the potential to learn more accurate dynamic-interaction models than their non-equivariant counterparts. We benchmark two well-studied fluid flow systems, namely the 3D decaying Taylor-Green vortex and the 3D reverse Poiseuille flow, and compare equivariant graph neural networks to their non-equivariant counterparts on different performance measures, such as kinetic energy or Sinkhorn distance. Such measures are typically used in engineering to validate numerical solvers. Our main findings are that while being rather slow to train and evaluate, equivariant models learn more physically accurate interactions. This indicates opportunities for future work towards coarse-grained models for turbulent flows, and generalization across system dynamics and parameters.

* ICLR 2023 Workshop on Physics for Machine Learning

Via

Access Paper or Ask Questions

G-Signatures: Global Graph Propagation With Randomized Signatures

Feb 17, 2023
Bernhard Schäfl, Lukas Gruber, Johannes Brandstetter, Sepp Hochreiter

Figure 1 for G-Signatures: Global Graph Propagation With Randomized Signatures

Figure 2 for G-Signatures: Global Graph Propagation With Randomized Signatures

Figure 3 for G-Signatures: Global Graph Propagation With Randomized Signatures

Figure 4 for G-Signatures: Global Graph Propagation With Randomized Signatures

Graph neural networks (GNNs) have evolved into one of the most popular deep learning architectures. However, GNNs suffer from over-smoothing node information and, therefore, struggle to solve tasks where global graph properties are relevant. We introduce G-Signatures, a novel graph learning method that enables global graph propagation via randomized signatures. G-Signatures use a new graph lifting concept to embed graph structured information, which can be interpreted as path in latent space. We further introduce the idea of latent space path mapping, which allows us to repetitively traverse latent space paths, and, thus globally process information. G-Signatures excel at extracting and processing global graph properties, and effectively scale to large graph problems. Empirically, we confirm the advantages of our G-Signatures at several classification and regression tasks.

* 10 pages (+ appendix); 6 figures

Via

Access Paper or Ask Questions

Geometric Clifford Algebra Networks

Feb 13, 2023
David Ruhe, Jayesh K. Gupta, Steven de Keninck, Max Welling, Johannes Brandstetter

Figure 1 for Geometric Clifford Algebra Networks

Figure 2 for Geometric Clifford Algebra Networks

Figure 3 for Geometric Clifford Algebra Networks

Figure 4 for Geometric Clifford Algebra Networks

We propose Geometric Clifford Algebra Networks (GCANs) that are based on symmetry group transformations using geometric (Clifford) algebras. GCANs are particularly well-suited for representing and manipulating geometric transformations, often found in dynamical systems. We first review the quintessence of modern (plane-based) geometric algebra, which builds on isometries encoded as elements of the $\mathrm{Pin}(p,q,r)$ group. We then propose the concept of group action layers, which linearly combine object transformations using pre-specified group actions. Together with a new activation and normalization scheme, these layers serve as adjustable geometric templates that can be refined via gradient descent. Theoretical advantages are strongly reflected in the modeling of three-dimensional rigid body transformations as well as large-scale fluid dynamics simulations, showing significantly improved performance over traditional methods.

Via

Access Paper or Ask Questions

ClimaX: A foundation model for weather and climate

Jan 24, 2023
Tung Nguyen, Johannes Brandstetter, Ashish Kapoor, Jayesh K. Gupta, Aditya Grover

Figure 1 for ClimaX: A foundation model for weather and climate

Figure 2 for ClimaX: A foundation model for weather and climate

Figure 3 for ClimaX: A foundation model for weather and climate

Figure 4 for ClimaX: A foundation model for weather and climate

Most state-of-the-art approaches for weather and climate modeling are based on physics-informed numerical models of the atmosphere. These approaches aim to model the non-linear dynamics and complex interactions between multiple variables, which are challenging to approximate. Additionally, many such numerical models are computationally intensive, especially when modeling the atmospheric phenomenon at a fine-grained spatial and temporal resolution. Recent data-driven approaches based on machine learning instead aim to directly solve a downstream forecasting or projection task by learning a data-driven functional mapping using deep neural networks. However, these networks are trained using curated and homogeneous climate datasets for specific spatiotemporal tasks, and thus lack the generality of numerical models. We develop and demonstrate ClimaX, a flexible and generalizable deep learning model for weather and climate science that can be trained using heterogeneous datasets spanning different variables, spatio-temporal coverage, and physical groundings. ClimaX extends the Transformer architecture with novel encoding and aggregation blocks that allow effective use of available compute while maintaining general utility. ClimaX is pre-trained with a self-supervised learning objective on climate datasets derived from CMIP6. The pre-trained ClimaX can then be fine-tuned to address a breadth of climate and weather tasks, including those that involve atmospheric variables and spatio-temporal scales unseen during pretraining. Compared to existing data-driven baselines, we show that this generality in ClimaX results in superior performance on benchmarks for weather forecasting and climate projections, even when pretrained at lower resolutions and compute budgets.

Via

Access Paper or Ask Questions

Towards Multi-spatiotemporal-scale Generalized PDE Modeling

Sep 30, 2022
Jayesh K. Gupta, Johannes Brandstetter

Figure 1 for Towards Multi-spatiotemporal-scale Generalized PDE Modeling

Figure 2 for Towards Multi-spatiotemporal-scale Generalized PDE Modeling

Figure 3 for Towards Multi-spatiotemporal-scale Generalized PDE Modeling

Figure 4 for Towards Multi-spatiotemporal-scale Generalized PDE Modeling

Partial differential equations (PDEs) are central to describing complex physical system simulations. Their expensive solution techniques have led to an increased interest in deep neural network based surrogates. However, the practical utility of training such surrogates is contingent on their ability to model complex multi-scale spatio-temporal phenomena. Various neural network architectures have been proposed to target such phenomena, most notably Fourier Neural Operators (FNOs) which give a natural handle over local \& global spatial information via parameterization of different Fourier modes, and U-Nets which treat local and global information via downsampling and upsampling paths. However, generalizing across different equation parameters or different time-scales still remains a challenge. In this work, we make a comprehensive comparison between various FNO and U-Net like approaches on fluid mechanics problems in both vorticity-stream and velocity function form. For U-Nets, we transfer recent architectural improvements from computer vision, most notably from object segmentation and generative modeling. We further analyze the design considerations for using FNO layers to improve performance of U-Net architectures without major degradation of computational performance. Finally, we show promising results on generalization to different PDE parameters and time-scales with a single surrogate model.

Via

Access Paper or Ask Questions

Clifford Neural Layers for PDE Modeling

Sep 08, 2022
Johannes Brandstetter, Rianne van den Berg, Max Welling, Jayesh K. Gupta

Figure 1 for Clifford Neural Layers for PDE Modeling

Figure 2 for Clifford Neural Layers for PDE Modeling

Figure 3 for Clifford Neural Layers for PDE Modeling

Figure 4 for Clifford Neural Layers for PDE Modeling

Partial differential equations (PDEs) see widespread use in sciences and engineering to describe simulation of physical processes as scalar and vector fields interacting and coevolving over time. Due to the computationally expensive nature of their standard solution methods, neural PDE surrogates have become an active research topic to accelerate these simulations. However, current methods do not explicitly take into account the relationship between different fields and their internal components, which are often correlated. Viewing the time evolution of such correlated fields through the lens of multivector fields allows us to overcome these limitations. Multivector fields consist of scalar, vector, as well as higher-order components, such as bivectors and trivectors. Their algebraic properties, such as multiplication, addition and other arithmetic operations can be described by Clifford algebras. To our knowledge, this paper presents the first usage of such multivector representations together with Clifford convolutions and Clifford Fourier transforms in the context of deep learning. The resulting Clifford neural layers are universally applicable and will find direct use in the areas of fluid dynamics, weather forecasting, and the modeling of physical systems in general. We empirically evaluate the benefit of Clifford neural layers by replacing convolution and Fourier operations in common neural PDE surrogates by their Clifford counterparts on two-dimensional Navier-Stokes and weather modeling tasks, as well as three-dimensional Maxwell equations. Clifford neural layers consistently improve generalization capabilities of the tested neural PDE surrogates.

Via

Access Paper or Ask Questions

Few-Shot Learning by Dimensionality Reduction in Gradient Space

Jun 07, 2022
Martin Gauch, Maximilian Beck, Thomas Adler, Dmytro Kotsur, Stefan Fiel, Hamid Eghbal-zadeh, Johannes Brandstetter, Johannes Kofler, Markus Holzleitner, Werner Zellinger, Daniel Klotz, Sepp Hochreiter, Sebastian Lehner

Figure 1 for Few-Shot Learning by Dimensionality Reduction in Gradient Space

Figure 2 for Few-Shot Learning by Dimensionality Reduction in Gradient Space

Figure 3 for Few-Shot Learning by Dimensionality Reduction in Gradient Space

Figure 4 for Few-Shot Learning by Dimensionality Reduction in Gradient Space

We introduce SubGD, a novel few-shot learning method which is based on the recent finding that stochastic gradient descent updates tend to live in a low-dimensional parameter subspace. In experimental and theoretical analyses, we show that models confined to a suitable predefined subspace generalize well for few-shot learning. A suitable subspace fulfills three criteria across the given tasks: it (a) allows to reduce the training error by gradient flow, (b) leads to models that generalize well, and (c) can be identified by stochastic gradient descent. SubGD identifies these subspaces from an eigendecomposition of the auto-correlation matrix of update directions across different tasks. Demonstrably, we can identify low-dimensional suitable subspaces for few-shot learning of dynamical systems, which have varying properties described by one or few parameters of the analytical system description. Such systems are ubiquitous among real-world applications in science and engineering. We experimentally corroborate the advantages of SubGD on three distinct dynamical systems problem settings, significantly outperforming popular few-shot learning methods both in terms of sample efficiency and performance.

* Accepted at Conference on Lifelong Learning Agents (CoLLAs) 2022. Code: https://github.com/ml-jku/subgd Blog post: https://ml-jku.github.io/subgd

Via

Access Paper or Ask Questions

Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Feb 15, 2022
Johannes Brandstetter, Max Welling, Daniel E. Worrall

Figure 1 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 2 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 3 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 4 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Neural networks are increasingly being used to solve partial differential equations (PDEs), replacing slower numerical solvers. However, a critical issue is that neural PDE solvers require high-quality ground truth data, which usually must come from the very solvers they are designed to replace. Thus, we are presented with a proverbial chicken-and-egg problem. In this paper, we present a method, which can partially alleviate this problem, by improving neural PDE solver sample complexity -- Lie point symmetry data augmentation (LPSDA). In the context of PDEs, it turns out that we are able to quantitatively derive an exhaustive list of data transformations, based on the Lie point symmetry group of the PDEs in question, something not possible in other application areas. We present this framework and demonstrate how it can easily be deployed to improve neural PDE solver sample complexity by an order of magnitude.

Via

Access Paper or Ask Questions