Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Y. Lu

Hierarchical Implicit Neural Emulators

Jun 05, 2025

Ruoxi Jiang, Xiao Zhang, Karan Jakhar, Peter Y. Lu, Pedram Hassanzadeh, Michael Maire, Rebecca Willett

Abstract:Neural PDE solvers offer a powerful tool for modeling complex dynamical systems, but often struggle with error accumulation over long time horizons and maintaining stability and physical consistency. We introduce a multiscale implicit neural emulator that enhances long-term prediction accuracy by conditioning on a hierarchy of lower-dimensional future state representations. Drawing inspiration from the stability properties of numerical implicit time-stepping methods, our approach leverages predictions several steps ahead in time at increasing compression rates for next-timestep refinements. By actively adjusting the temporal downsampling ratios, our design enables the model to capture dynamics across multiple granularities and enforce long-range temporal coherence. Experiments on turbulent fluid dynamics show that our method achieves high short-term accuracy and produces long-term stable forecasts, significantly outperforming autoregressive baselines while adding minimal computational overhead.

Via

Access Paper or Ask Questions

Embed and Emulate: Contrastive representations for simulation-based inference

Sep 27, 2024

Ruoxi Jiang, Peter Y. Lu, Rebecca Willett

Figure 1 for Embed and Emulate: Contrastive representations for simulation-based inference

Figure 2 for Embed and Emulate: Contrastive representations for simulation-based inference

Figure 3 for Embed and Emulate: Contrastive representations for simulation-based inference

Figure 4 for Embed and Emulate: Contrastive representations for simulation-based inference

Abstract:Scientific modeling and engineering applications rely heavily on parameter estimation methods to fit physical models and calibrate numerical simulations using real-world measurements. In the absence of analytic statistical models with tractable likelihoods, modern simulation-based inference (SBI) methods first use a numerical simulator to generate a dataset of parameters and simulated outputs. This dataset is then used to approximate the likelihood and estimate the system parameters given observation data. Several SBI methods employ machine learning emulators to accelerate data generation and parameter estimation. However, applying these approaches to high-dimensional physical systems remains challenging due to the cost and complexity of training high-dimensional emulators. This paper introduces Embed and Emulate (E&E): a new SBI method based on contrastive learning that efficiently handles high-dimensional data and complex, multimodal parameter posteriors. E&E learns a low-dimensional latent embedding of the data (i.e., a summary statistic) and a corresponding fast emulator in the latent space, eliminating the need to run expensive simulations or a high dimensional emulator during inference. We illustrate the theoretical properties of the learned latent space through a synthetic experiment and demonstrate superior performance over existing methods in a realistic, non-identifiable parameter estimation task using the high-dimensional, chaotic Lorenz 96 system.

Via

Access Paper or Ask Questions

Multimodal Learning for Crystalline Materials

Nov 30, 2023

Viggo Moro, Charlotte Loh, Rumen Dangovski, Ali Ghorashi, Andrew Ma, Zhuo Chen, Peter Y. Lu, Thomas Christensen, Marin Soljačić

Figure 1 for Multimodal Learning for Crystalline Materials

Figure 2 for Multimodal Learning for Crystalline Materials

Figure 3 for Multimodal Learning for Crystalline Materials

Figure 4 for Multimodal Learning for Crystalline Materials

Abstract:Artificial intelligence (AI) has revolutionized the field of materials science by improving the prediction of properties and accelerating the discovery of novel materials. In recent years, publicly available material data repositories containing data for various material properties have grown rapidly. In this work, we introduce Multimodal Learning for Crystalline Materials (MLCM), a new method for training a foundation model for crystalline materials via multimodal alignment, where high-dimensional material properties (i.e. modalities) are connected in a shared latent space to produce highly useful material representations. We show the utility of MLCM on multiple axes: (i) MLCM achieves state-of-the-art performance for material property prediction on the challenging Materials Project database; (ii) MLCM enables a novel, highly accurate method for inverse design, allowing one to screen for stable material with desired properties; and (iii) MLCM allows the extraction of interpretable emergent features that may provide insight to material scientists. Further, we explore several novel methods for aligning an arbitrary number of modalities, improving upon prior art in multimodal learning that focuses on bimodal alignment. Our work brings innovations from the ongoing AI revolution into the domain of materials science and identifies materials as a testbed for the next generation of AI.

* 11 pages, 3 figures

Via

Access Paper or Ask Questions

Training neural operators to preserve invariant measures of chaotic attractors

Jun 01, 2023

Ruoxi Jiang, Peter Y. Lu, Elena Orlova, Rebecca Willett

Abstract:Chaotic systems make long-horizon forecasts difficult because small perturbations in initial conditions cause trajectories to diverge at an exponential rate. In this setting, neural operators trained to minimize squared error losses, while capable of accurate short-term forecasts, often fail to reproduce statistical or structural properties of the dynamics over longer time horizons and can yield degenerate results. In this paper, we propose an alternative framework designed to preserve invariant measures of chaotic attractors that characterize the time-invariant statistical properties of the dynamics. Specifically, in the multi-environment setting (where each sample trajectory is governed by slightly different dynamics), we consider two novel approaches to training with noisy data. First, we propose a loss based on the optimal transport distance between the observed dynamics and the neural operator outputs. This approach requires expert knowledge of the underlying physics to determine what statistical features should be included in the optimal transport loss. Second, we show that a contrastive learning framework, which does not require any specialized prior knowledge, can preserve statistical properties of the dynamics nearly as well as the optimal transport approach. On a variety of chaotic systems, our method is shown empirically to preserve invariant measures of chaotic attractors.

Via

Access Paper or Ask Questions

Deep Stochastic Mechanics

May 31, 2023

Elena Orlova, Aleksei Ustimenko, Ruoxi Jiang, Peter Y. Lu, Rebecca Willett

Abstract:This paper introduces a novel deep-learning-based approach for numerical simulation of a time-evolving Schr\"odinger equation inspired by stochastic mechanics and generative diffusion models. Unlike existing approaches, which exhibit computational complexity that scales exponentially in the problem dimension, our method allows us to adapt to the latent low-dimensional structure of the wave function by sampling from the Markovian diffusion. Depending on the latent dimension, our method may have far lower computational complexity in higher dimensions. Moreover, we propose novel equations for stochastic quantum mechanics, resulting in linear computational complexity with respect to the number of dimensions. Numerical simulations verify our theoretical findings and show a significant advantage of our method compared to other deep-learning-based approaches used for quantum mechanics.

Via

Access Paper or Ask Questions

Model Stitching: Looking For Functional Similarity Between Representations

Mar 20, 2023

Adriano Hernandez, Rumen Dangovski, Peter Y. Lu, Marin Soljacic

Figure 1 for Model Stitching: Looking For Functional Similarity Between Representations

Figure 2 for Model Stitching: Looking For Functional Similarity Between Representations

Figure 3 for Model Stitching: Looking For Functional Similarity Between Representations

Figure 4 for Model Stitching: Looking For Functional Similarity Between Representations

Abstract:Model stitching (Lenc & Vedaldi 2015) is a compelling methodology to compare different neural network representations, because it allows us to measure to what degree they may be interchanged. We expand on a previous work from Bansal, Nakkiran & Barak which used model stitching to compare representations of the same shapes learned by differently seeded and/or trained neural networks of the same architecture. Our contribution enables us to compare the representations learned by layers with different shapes from neural networks with different architectures. We subsequently reveal unexpected behavior of model stitching. Namely, we find that stitching, based on convolutions, for small ResNets, can reach high accuracy if those layers come later in the first (sender) network than in the second (receiver), even if those layers are far apart.

* 5 pages, 2 figures

Via

Access Paper or Ask Questions

Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows

Feb 23, 2023

Owen Dugan, Peter Y. Lu, Rumen Dangovski, Di Luo, Marin Soljačić

Figure 1 for Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows

Figure 2 for Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows

Figure 3 for Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows

Figure 4 for Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows

Abstract:Studying the dynamics of open quantum systems holds the potential to enable breakthroughs both in fundamental physics and applications to quantum engineering and quantum computation. Due to the high-dimensional nature of the problem, customized deep generative neural networks have been instrumental in modeling the high-dimensional density matrix $\rho$, which is the key description for the dynamics of such systems. However, the complex-valued nature and normalization constraints of $\rho$, as well as its complicated dynamics, prohibit a seamless connection between open quantum systems and the recent advances in deep generative modeling. Here we lift that limitation by utilizing a reformulation of open quantum system dynamics to a partial differential equation (PDE) for a corresponding probability distribution $Q$, the Husimi Q function. Thus, we model the Q function seamlessly with off-the-shelf deep generative models such as normalizing flows. Additionally, we develop novel methods for learning normalizing flow evolution governed by high-dimensional PDEs, based on the Euler method and the application of the time-dependent variational principle. We name the resulting approach Q-Flow and demonstrate the scalability and efficiency of Q-Flow on open quantum system simulations, including the dissipative harmonic oscillator and the dissipative bosonic model. Q-Flow is superior to conventional PDE solvers and state-of-the-art physics-informed neural network solvers, especially in high-dimensional systems.

Via

Access Paper or Ask Questions

Discovering Conservation Laws using Optimal Transport and Manifold Learning

Aug 31, 2022

Peter Y. Lu, Rumen Dangovski, Marin Soljačić

Figure 1 for Discovering Conservation Laws using Optimal Transport and Manifold Learning

Figure 2 for Discovering Conservation Laws using Optimal Transport and Manifold Learning

Figure 3 for Discovering Conservation Laws using Optimal Transport and Manifold Learning

Figure 4 for Discovering Conservation Laws using Optimal Transport and Manifold Learning

Abstract:Conservation laws are key theoretical and practical tools for understanding, characterizing, and modeling nonlinear dynamical systems. However, for many complex dynamical systems, the corresponding conserved quantities are difficult to identify, making it hard to analyze their dynamics and build efficient, stable predictive models. Current approaches for discovering conservation laws often depend on detailed dynamical information, such as the equation of motion or fine-grained time measurements, with many recent proposals also relying on black box parametric deep learning methods. We instead reformulate this task as a manifold learning problem and propose a non-parametric approach, combining the Wasserstein metric from optimal transport with diffusion maps, to discover conserved quantities that vary across trajectories sampled from a dynamical system. We test this new approach on a variety of physical systems$\unicode{x2014}$including conservative Hamiltonian systems, dissipative systems, and spatiotemporal systems$\unicode{x2014}$and demonstrate that our manifold learning method is able to both identify the number of conserved quantities and extract their values. Using tools from optimal transport theory and manifold learning, our proposed method provides a direct geometric approach to identifying conservation laws that is both robust and interpretable without requiring an explicit model of the system nor accurate time information.

* 16 pages, 11 figures (7 main text, 4 supplemental), 1 table (supplemental)

Via

Access Paper or Ask Questions

Deep Learning and Symbolic Regression for Discovering Parametric Equations

Jul 01, 2022

Michael Zhang, Samuel Kim, Peter Y. Lu, Marin Soljačić

Figure 1 for Deep Learning and Symbolic Regression for Discovering Parametric Equations

Figure 2 for Deep Learning and Symbolic Regression for Discovering Parametric Equations

Figure 3 for Deep Learning and Symbolic Regression for Discovering Parametric Equations

Figure 4 for Deep Learning and Symbolic Regression for Discovering Parametric Equations

Abstract:Symbolic regression is a machine learning technique that can learn the governing formulas of data and thus has the potential to transform scientific discovery. However, symbolic regression is still limited in the complexity and dimensionality of the systems that it can analyze. Deep learning on the other hand has transformed machine learning in its ability to analyze extremely complex and high-dimensional datasets. We propose a neural network architecture to extend symbolic regression to parametric systems where some coefficient may vary but the structure of the underlying governing equation remains constant. We demonstrate our method on various analytic expressions, ODEs, and PDEs with varying coefficients and show that it extrapolates well outside of the training domain. The neural network-based architecture can also integrate with other deep learning architectures so that it can analyze high-dimensional data while being trained end-to-end. To this end we integrate our architecture with convolutional neural networks to analyze 1D images of varying spring systems.

* Michael Zhang and Samuel Kim contributed equally to this work. 10 pages, 14 figures

Via

Access Paper or Ask Questions

Discovering Sparse Interpretable Dynamics from Partial Observations

Jul 22, 2021

Peter Y. Lu, Joan Ariño, Marin Soljačić

Figure 1 for Discovering Sparse Interpretable Dynamics from Partial Observations

Figure 2 for Discovering Sparse Interpretable Dynamics from Partial Observations

Figure 3 for Discovering Sparse Interpretable Dynamics from Partial Observations

Figure 4 for Discovering Sparse Interpretable Dynamics from Partial Observations

Abstract:Identifying the governing equations of a nonlinear dynamical system is key to both understanding the physical features of the system and constructing an accurate model of the dynamics that generalizes well beyond the available data. We propose a machine learning framework for discovering these governing equations using only partial observations, combining an encoder for state reconstruction with a sparse symbolic model. Our tests show that this method can successfully reconstruct the full system state and identify the underlying dynamics for a variety of ODE and PDE systems.

* 8 pages, 4 figures

Via

Access Paper or Ask Questions