Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Youngkyu Lee

Spectrally Safe Neural Operator Warm-Starts for Large-Scale Newton Solvers

Jun 20, 2026

Jaemin Oh, Youngkyu Lee, Jerome Darbon, George Em Karniadakis

Abstract:Neural operators are increasingly used to warm-start Newton solvers for nonlinear PDEs, on the premise that a low test error places the initial guess inside the basin of attraction. We show that this premise is unreliable. An operator trained to the relative \(L^2\) error \(O(10^{-3})\) can still produce an initial state in which the discrete Jacobian is indefinite, because the mean-squared training controls error on average while leaving localized pointwise violations of the underlying physics. For a nearly incompressible hyperelasticity problem, we trace this to the predicted volume change: the operator disperses \(\mathrm{det} F\) well away from one, and the resulting Jacobian acquires negative eigenvalues even when the predicted field is visually indistinguishable from the reference. At a small scale, this is a nuisance; at a multi-million degree-of-freedom scale, it is disqualifying, since the conjugate gradient and other Krylov solvers needed for memory-feasible Newton steps assume a definite spectrum. We then show that a short, label-free fine-tuning phase -- penalizing the operator against the discrete energy, with no additional solution data -- shifts the Jacobian spectrum back to positive definite. Combined with an inexact outer loop, this gives a warm-started Newton method that converges across the full loading range where the unregularized operator fails, reaching up to 5.4\(\times\) wall-clock speedup over incremental continuation on a 3D problem with 6.4 million degrees of freedom.

* 23 pages, 8 figures, 7 tables

Via

Access Paper or Ask Questions

Hybrid Iterative Solvers with Geometry-Aware Neural Preconditioners for Parametric PDEs

Dec 16, 2025

Youngkyu Lee, Francesc Levrero Florencio, Jay Pathak, George Em Karniadakis

Figure 1 for Hybrid Iterative Solvers with Geometry-Aware Neural Preconditioners for Parametric PDEs

Figure 2 for Hybrid Iterative Solvers with Geometry-Aware Neural Preconditioners for Parametric PDEs

Abstract:The convergence behavior of classical iterative solvers for parametric partial differential equations (PDEs) is often highly sensitive to the domain and specific discretization of PDEs. Previously, we introduced hybrid solvers by combining the classical solvers with neural operators for a specific geometry 1, but they tend to under-perform in geometries not encountered during training. To address this challenge, we introduce Geo-DeepONet, a geometry-aware deep operator network that incorporates domain information extracted from finite element discretizations. Geo-DeepONet enables accurate operator learning across arbitrary unstructured meshes without requiring retraining. Building on this, we develop a class of geometry-aware hybrid preconditioned iterative solvers by coupling Geo-DeepONet with traditional methods such as relaxation schemes and Krylov subspace algorithms. Through numerical experiments on parametric PDEs posed over diverse unstructured domains, we demonstrate the enhanced robustness and efficiency of the proposed hybrid solvers for multiple real-world applications.

* 19 pages, 10 figures, 3 tables

Via

Access Paper or Ask Questions

A Neural-Operator Preconditioned Newton Method for Accelerated Nonlinear Solvers

Nov 11, 2025

Youngkyu Lee, Shanqing Liu, Jerome Darbon, George Em Karniadakis

Figure 1 for A Neural-Operator Preconditioned Newton Method for Accelerated Nonlinear Solvers

Figure 2 for A Neural-Operator Preconditioned Newton Method for Accelerated Nonlinear Solvers

Figure 3 for A Neural-Operator Preconditioned Newton Method for Accelerated Nonlinear Solvers

Figure 4 for A Neural-Operator Preconditioned Newton Method for Accelerated Nonlinear Solvers

Abstract:We propose a novel neural preconditioned Newton (NP-Newton) method for solving parametric nonlinear systems of equations. To overcome the stagnation or instability of Newton iterations caused by unbalanced nonlinearities, we introduce a fixed-point neural operator (FPNO) that learns the direct mapping from the current iterate to the solution by emulating fixed-point iterations. Unlike traditional line-search or trust-region algorithms, the proposed FPNO adaptively employs negative step sizes to effectively mitigate the effects of unbalanced nonlinearities. Through numerical experiments we demonstrate the computational efficiency and robustness of the proposed NP-Newton method across multiple real-world applications, especially for very strong nonlinearities.

* 14 pages, 5 figures, 7 tables

Via

Access Paper or Ask Questions

Leveraging Operator Learning to Accelerate Convergence of the Preconditioned Conjugate Gradient Method

Jul 31, 2025

Alena Kopaničáková, Youngkyu Lee, George Em Karniadakis

Abstract:We propose a new deflation strategy to accelerate the convergence of the preconditioned conjugate gradient(PCG) method for solving parametric large-scale linear systems of equations. Unlike traditional deflation techniques that rely on eigenvector approximations or recycled Krylov subspaces, we generate the deflation subspaces using operator learning, specifically the Deep Operator Network~(DeepONet). To this aim, we introduce two complementary approaches for assembling the deflation operators. The first approach approximates near-null space vectors of the discrete PDE operator using the basis functions learned by the DeepONet. The second approach directly leverages solutions predicted by the DeepONet. To further enhance convergence, we also propose several strategies for prescribing the sparsity pattern of the deflation operator. A comprehensive set of numerical experiments encompassing steady-state, time-dependent, scalar, and vector-valued problems posed on both structured and unstructured geometries is presented and demonstrates the effectiveness of the proposed DeepONet-based deflated PCG method, as well as its generalization across a wide range of model parameters and problem resolutions.

* 31 pages

Via

Access Paper or Ask Questions

A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Jun 22, 2024

Chang-Ock Lee, Youngkyu Lee, Byungeun Ryoo

Figure 1 for A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Figure 2 for A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Figure 3 for A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Figure 4 for A Nonoverlapping Domain Decomposition Method for Extreme Learning Machines: Elliptic Problems

Abstract:Extreme learning machine (ELM) is a methodology for solving partial differential equations (PDEs) using a single hidden layer feed-forward neural network. It presets the weight/bias coefficients in the hidden layer with random values, which remain fixed throughout the computation, and uses a linear least squares method for training the parameters of the output layer of the neural network. It is known to be much faster than Physics informed neural networks. However, classical ELM is still computationally expensive when a high level of representation is desired in the solution as this requires solving a large least squares system. In this paper, we propose a nonoverlapping domain decomposition method (DDM) for ELMs that not only reduces the training time of ELMs, but is also suitable for parallel computation. In numerical analysis, DDMs have been widely studied to reduce the time to obtain finite element solutions for elliptic PDEs through parallel computation. Among these approaches, nonoverlapping DDMs are attracting the most attention. Motivated by these methods, we introduce local neural networks, which are valid only at corresponding subdomains, and an auxiliary variable at the interface. We construct a system on the variable and the parameters of local neural networks. A Schur complement system on the interface can be derived by eliminating the parameters of the output layer. The auxiliary variable is then directly obtained by solving the reduced system after which the parameters for each local neural network are solved in parallel. A method for initializing the hidden layer parameters suitable for high approximation quality in large systems is also proposed. Numerical results that verify the acceleration performance of the proposed method with respect to the number of subdomains are presented.

* 18 pages, 4 figures, 7 tables

Via

Access Paper or Ask Questions

Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

Jun 16, 2024

Youngkyu Lee, Alena Kopaničáková, George Em Karniadakis

Figure 1 for Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

Figure 2 for Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

Figure 3 for Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

Figure 4 for Two-level overlapping additive Schwarz preconditioner for training scientific machine learning applications

Abstract:We introduce a novel two-level overlapping additive Schwarz preconditioner for accelerating the training of scientific machine learning applications. The design of the proposed preconditioner is motivated by the nonlinear two-level overlapping additive Schwarz preconditioner. The neural network parameters are decomposed into groups (subdomains) with overlapping regions. In addition, the network's feed-forward structure is indirectly imposed through a novel subdomain-wise synchronization strategy and a coarse-level training step. Through a series of numerical experiments, which consider physics-informed neural networks and operator learning approaches, we demonstrate that the proposed two-level preconditioner significantly speeds up the convergence of the standard (LBFGS) optimizer while also yielding more accurate machine learning models. Moreover, the devised preconditioner is designed to take advantage of model-parallel computations, which can further reduce the training time.

* 24 pages, 9 figures

Via

Access Paper or Ask Questions

Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Oct 19, 2023

Youngkyu Lee, Jongho Park, Chang-Ock Lee

Figure 1 for Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Figure 2 for Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Figure 3 for Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Figure 4 for Balanced Group Convolution: An Improved Group Convolution Based on Approximability Estimates

Abstract:The performance of neural networks has been significantly improved by increasing the number of channels in convolutional layers. However, this increase in performance comes with a higher computational cost, resulting in numerous studies focused on reducing it. One promising approach to address this issue is group convolution, which effectively reduces the computational cost by grouping channels. However, to the best of our knowledge, there has been no theoretical analysis on how well the group convolution approximates the standard convolution. In this paper, we mathematically analyze the approximation of the group convolution to the standard convolution with respect to the number of groups. Furthermore, we propose a novel variant of the group convolution called balanced group convolution, which shows a higher approximation with a small additional computational cost. We provide experimental results that validate our theoretical findings and demonstrate the superior performance of the balanced group convolution over other variants of group convolution.

* 26pages, 2 figures

Via

Access Paper or Ask Questions

Two-level Group Convolution

Oct 11, 2021

Youngkyu Lee, Jongho Park, Chang-Ock Lee

Abstract:Group convolution has been widely used in order to reduce the computation time of convolution, which takes most of the training time of convolutional neural networks. However, it is well known that a large number of groups significantly reduce the performance of group convolution. In this paper, we propose a new convolution methodology called ``two-level'' group convolution that is robust with respect to the increase of the number of groups and suitable for multi-GPU parallel computation. We first observe that the group convolution can be interpreted as a one-level block Jacobi approximation of the standard convolution, which is a popular notion in the field of numerical analysis. In numerical analysis, there have been numerous studies on the two-level method that introduces an intergroup structure that resolves the performance degradation issue without disturbing parallel computation. Motivated by these, we introduce a coarse-level structure which promotes intergroup communication without being a bottleneck in the group convolution. We show that all the additional work induced by the coarse-level structure can be efficiently processed in a distributed memory system. Numerical results that verify the robustness of the proposed method with respect to the number of groups are presented. Moreover, we compare the proposed method to various approaches for group convolution in order to highlight the superiority of the proposed method in terms of execution time, memory efficiency, and performance.

Via

Access Paper or Ask Questions

Parareal Neural Networks Emulating a Parallel-in-time Algorithm

Mar 16, 2021

Chang-Ock Lee, Youngkyu Lee, Jongho Park

Abstract:As deep neural networks (DNNs) become deeper, the training time increases. In this perspective, multi-GPU parallel computing has become a key tool in accelerating the training of DNNs. In this paper, we introduce a novel methodology to construct a parallel neural network that can utilize multiple GPUs simultaneously from a given DNN. We observe that layers of DNN can be interpreted as the time step of a time-dependent problem and can be parallelized by emulating a parallel-in-time algorithm called parareal. The parareal algorithm consists of fine structures which can be implemented in parallel and a coarse structure which gives suitable approximations to the fine structures. By emulating it, the layers of DNN are torn to form a parallel structure which is connected using a suitable coarse network. We report accelerated and accuracy-preserved results of the proposed methodology applied to VGG-16 and ResNet-1001 on several datasets.

Via

Access Paper or Ask Questions