Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Radu Balan

PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training

Jul 23, 2025

Anirudh Satheesh, Anant Khandelwal, Mucong Ding, Radu Balan

Abstract:Neural operators offer a powerful paradigm for solving partial differential equations (PDEs) that cannot be solved analytically by learning mappings between function spaces. However, there are two main bottlenecks in training neural operators: they require a significant amount of training data to learn these mappings, and this data needs to be labeled, which can only be accessed via expensive simulations with numerical solvers. To alleviate both of these issues simultaneously, we propose PICore, an unsupervised coreset selection framework that identifies the most informative training samples without requiring access to ground-truth PDE solutions. PICore leverages a physics-informed loss to select unlabeled inputs by their potential contribution to operator learning. After selecting a compact subset of inputs, only those samples are simulated using numerical solvers to generate labels, reducing annotation costs. We then train the neural operator on the reduced labeled dataset, significantly decreasing training time as well. Across four diverse PDE benchmarks and multiple coreset selection strategies, PICore achieves up to 78% average increase in training efficiency relative to supervised coreset selection methods with minimal changes in accuracy. We provide code at https://github.com/Asatheesh6561/PICore.

* Submitted to TMLR 2025

Via

Access Paper or Ask Questions

Stability of sorting based embeddings

Oct 07, 2024

Radu Balan, Efstratios Tsoukanis, Matthias Wellershoff

Abstract:Consider a group $G$ of order $M$ acting unitarily on a real inner product space $V$. We show that the sorting based embedding obtained by applying a general linear map $\alpha : \mathbb{R}^{M \times N} \to \mathbb{R}^D$ to the invariant map $\beta_\Phi : V \to \mathbb{R}^{M \times N}$ given by sorting the coorbits $(\langle v, g \phi_i \rangle_V)_{g \in G}$, where $(\phi_i)_{i=1}^N \in V$, satisfies a bi-Lipschitz condition if and only if it separates orbits. Additionally, we note that any invariant Lipschitz continuous map (into a Hilbert space) factors through the sorting based embedding, and that any invariant continuous map (into a locally convex space) factors through the sorting based embedding as well.

* 25 pages

Via

Access Paper or Ask Questions

Approximation of the Proximal Operator of the $\ell_\infty$ Norm Using a Neural Network

Aug 20, 2024

Kathryn Linehan, Radu Balan

Abstract:Computing the proximal operator of the $\ell_\infty$ norm, $\textbf{prox}_{\alpha ||\cdot||_\infty}(\mathbf{x})$, generally requires a sort of the input data, or at least a partial sort similar to quicksort. In order to avoid using a sort, we present an $O(m)$ approximation of $\textbf{prox}_{\alpha ||\cdot||_\infty}(\mathbf{x})$ using a neural network. A novel aspect of the network is that it is able to accept vectors of varying lengths due to a feature selection process that uses moments of the input data. We present results on the accuracy of the approximation, feature importance, and computational efficiency of the approach. We show that the network outperforms a "vanilla neural network" that does not use feature selection. We also present an algorithm with corresponding theory to calculate $\textbf{prox}_{\alpha ||\cdot||_\infty}(\mathbf{x})$ exactly, relate it to the Moreau decomposition, and compare its computational efficiency to that of the approximation.

* 30 pages, 5 figures, 6 tables

Via

Access Paper or Ask Questions

Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

Mar 04, 2023

Xiongye Xiao, Defu Cao, Ruochen Yang, Gaurav Gupta, Gengshuo Liu, Chenzhong Yin, Radu Balan, Paul Bogdan

Figure 1 for Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

Figure 2 for Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

Figure 3 for Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

Figure 4 for Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

Abstract:Coupled partial differential equations (PDEs) are key tasks in modeling the complex dynamics of many physical processes. Recently, neural operators have shown the ability to solve PDEs by learning the integral kernel directly in Fourier/Wavelet space, so the difficulty for solving the coupled PDEs depends on dealing with the coupled mappings between the functions. Towards this end, we propose a \textit{coupled multiwavelets neural operator} (CMWNO) learning scheme by decoupling the coupled integral kernels during the multiwavelet decomposition and reconstruction procedures in the Wavelet space. The proposed model achieves significantly higher accuracy compared to previous learning-based solvers in solving the coupled PDEs including Gray-Scott (GS) equations and the non-local mean field game (MFG) problem. According to our experimental results, the proposed model exhibits a $2\times \sim 4\times$ improvement relative $L$2 error compared to the best results from the state-of-the-art models.

* International Conference on Learning Representations (ICLR-2023)
* Accepted to ICLR 2023

Via

Access Paper or Ask Questions

VQ-Flows: Vector Quantized Local Normalizing Flows

Mar 22, 2022

Sahil Sidheekh, Chris B. Dock, Tushar Jain, Radu Balan, Maneesh K. Singh

Figure 1 for VQ-Flows: Vector Quantized Local Normalizing Flows

Figure 2 for VQ-Flows: Vector Quantized Local Normalizing Flows

Figure 3 for VQ-Flows: Vector Quantized Local Normalizing Flows

Figure 4 for VQ-Flows: Vector Quantized Local Normalizing Flows

Abstract:Normalizing flows provide an elegant approach to generative modeling that allows for efficient sampling and exact density evaluation of unknown data distributions. However, current techniques have significant limitations in their expressivity when the data distribution is supported on a low-dimensional manifold or has a non-trivial topology. We introduce a novel statistical framework for learning a mixture of local normalizing flows as "chart maps" over the data manifold. Our framework augments the expressivity of recent approaches while preserving the signature property of normalizing flows, that they admit exact density evaluation. We learn a suitable atlas of charts for the data manifold via a vector quantized auto-encoder (VQ-AE) and the distributions over them using a conditional flow. We validate experimentally that our probabilistic framework enables existing approaches to better model data distributions over complex manifolds.

Via

Access Paper or Ask Questions

Permutation Invariant Representations with Applications to Graph Deep Learning

Mar 14, 2022

Radu Balan, Naveed Haghani, Maneesh Singh

Figure 1 for Permutation Invariant Representations with Applications to Graph Deep Learning

Figure 2 for Permutation Invariant Representations with Applications to Graph Deep Learning

Figure 3 for Permutation Invariant Representations with Applications to Graph Deep Learning

Figure 4 for Permutation Invariant Representations with Applications to Graph Deep Learning

Abstract:This paper presents primarily two Euclidean embeddings of the quotient space generated by matrices that are identified modulo arbitrary row permutations. The original application is in deep learning on graphs where the learning task is invariant to node relabeling. Two embedding schemes are introduced, one based on sorting and the other based on algebras of multivariate polynomials. While both embeddings exhibit a computational complexity exponential in problem size, the sorting based embedding is globally bi-Lipschitz and admits a low dimensional target space. Additionally, an almost everywhere injective scheme can be implemented with minimal redundancy and low computational cost. In turn, this proves that almost any classifier can be implemented with an arbitrary small loss of performance. Numerical experiments are carried out on two data sets, a chemical compound data set (QM9) and a proteins data set (PROTEINS).

* 43 pages, 13 figures, 16 tables

Via

Access Paper or Ask Questions

Convergence Guarantees for Deep Epsilon Greedy Policy Learning

Dec 02, 2021

Michael Rawson, Radu Balan

Figure 1 for Convergence Guarantees for Deep Epsilon Greedy Policy Learning

Figure 2 for Convergence Guarantees for Deep Epsilon Greedy Policy Learning

Abstract:Policy learning is a quickly growing area. As robotics and computers control day-to-day life, their error rate needs to be minimized and controlled. There are many policy learning methods and provable error rates that accompany them. We show an error or regret bound and convergence of the Deep Epsilon Greedy method which chooses actions with a neural network's prediction. In experiments with the real-world dataset MNIST, we construct a nonlinear reinforcement learning problem. We witness how with either high or low noise, some methods do and some do not converge which agrees with our proof of convergence.

Via

Access Paper or Ask Questions

An Exact Hypergraph Matching Algorithm for Nuclear Identification in Embryonic Caenorhabditis elegans

Apr 20, 2021

Andrew Lauziere, Ryan Christensen, Hari Shroff, Radu Balan

Figure 1 for An Exact Hypergraph Matching Algorithm for Nuclear Identification in Embryonic Caenorhabditis elegans

Figure 2 for An Exact Hypergraph Matching Algorithm for Nuclear Identification in Embryonic Caenorhabditis elegans

Figure 3 for An Exact Hypergraph Matching Algorithm for Nuclear Identification in Embryonic Caenorhabditis elegans

Figure 4 for An Exact Hypergraph Matching Algorithm for Nuclear Identification in Embryonic Caenorhabditis elegans

Abstract:Finding an optimal correspondence between point sets is a common task in computer vision. Existing techniques assume relatively simple relationships among points and do not guarantee an optimal match. We introduce an algorithm capable of exactly solving point set matching by modeling the task as hypergraph matching. The algorithm extends the classical branch and bound paradigm to select and aggregate vertices under a proposed decomposition of the multilinear objective function. The methodology is motivated by Caenorhabditis elegans, a model organism used frequently in developmental biology and neurobiology. The embryonic C. elegans contains seam cells that can act as fiducial markers allowing the identification of other nuclei during embryo development. The proposed algorithm identifies seam cells more accurately than established point-set matching methods, while providing a framework to approach other similarly complex point set matching tasks.

* 20 pages, 11 figures

Via

Access Paper or Ask Questions

On Lipschitz Bounds of General Convolutional Neural Networks

Aug 04, 2018

Dongmian Zou, Radu Balan, Maneesh Singh

Figure 1 for On Lipschitz Bounds of General Convolutional Neural Networks

Figure 2 for On Lipschitz Bounds of General Convolutional Neural Networks

Figure 3 for On Lipschitz Bounds of General Convolutional Neural Networks

Figure 4 for On Lipschitz Bounds of General Convolutional Neural Networks

Abstract:Many convolutional neural networks (CNNs) have a feed-forward structure. In this paper, a linear program that estimates the Lipschitz bound of such CNNs is proposed. Several CNNs, including the scattering networks, the AlexNet and the GoogleNet, are studied numerically and compared to the theoretical bounds. Next, concentration inequalities of the output distribution to a stationary random input signal expressed in terms of the Lipschitz bound are established. The Lipschitz bound is further used to establish a nonlinear discriminant analysis designed to measure the separation between features of different classes.

* 26 pages, 20 figures

Via

Access Paper or Ask Questions

Learning flexible representations of stochastic processes on graphs

Mar 13, 2018

Addison Bohannon, Brian Sadler, Radu Balan

Figure 1 for Learning flexible representations of stochastic processes on graphs

Figure 2 for Learning flexible representations of stochastic processes on graphs

Abstract:Graph convolutional networks adapt the architecture of convolutional neural networks to learn rich representations of data supported on arbitrary graphs by replacing the convolution operations of convolutional neural networks with graph-dependent linear operations. However, these graph-dependent linear operations are developed for scalar functions supported on undirected graphs. We propose a class of linear operations for stochastic (time-varying) processes on directed (or undirected) graphs to be used in graph convolutional networks. We propose a parameterization of such linear operations using functional calculus to achieve arbitrarily low learning complexity. The proposed approach is shown to model richer behaviors and display greater flexibility in learning representations than product graph methods.

Via

Access Paper or Ask Questions