Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alejandro Ribeiro

University of Pennsylvania

Non Commutative Convolutional Signal Models in Neural Networks: Stability to Small Deformations

Oct 05, 2023

Alejandro Parada-Mayorga, Landon Butler, Alejandro Ribeiro

Figure 1 for Non Commutative Convolutional Signal Models in Neural Networks: Stability to Small Deformations

Figure 2 for Non Commutative Convolutional Signal Models in Neural Networks: Stability to Small Deformations

Abstract:In this paper we discuss the results recently published in~[1] about algebraic signal models (ASMs) based on non commutative algebras and their use in convolutional neural networks. Relying on the general tools from algebraic signal processing (ASP), we study the filtering and stability properties of non commutative convolutional filters. We show how non commutative filters can be stable to small perturbations on the space of operators. We also show that although the spectral components of the Fourier representation in a non commutative signal model are associated to spaces of dimension larger than one, there is a trade-off between stability and selectivity similar to that observed for commutative models. Our results have direct implications for group neural networks, multigraph neural networks and quaternion neural networks, among other non commutative architectures. We conclude by corroborating these results through numerical experiments.

Via

Access Paper or Ask Questions

Network Alignment with Transferable Graph Autoencoders

Oct 05, 2023

Jiashu He, Charilaos I. Kanatsoulis, Alejandro Ribeiro

Abstract:Network alignment is the task of establishing one-to-one correspondences between the nodes of different graphs and finds a plethora of applications in high-impact domains. However, this task is known to be NP-hard in its general form, and existing algorithms do not scale up as the size of the graphs increases. To tackle both challenges we propose a novel generalized graph autoencoder architecture, designed to extract powerful and robust node embeddings, that are tailored to the alignment task. We prove that the generated embeddings are associated with the eigenvalues and eigenvectors of the graphs and can achieve more accurate alignment compared to classical spectral methods. Our proposed framework also leverages transfer learning and data augmentation to achieve efficient network alignment at a very large scale without retraining. Extensive experiments on both network and sub-network alignment with real-world graphs provide corroborating evidence supporting the effectiveness and scalability of the proposed approach.

Via

Access Paper or Ask Questions

Learning State-Augmented Policies for Information Routing in Communication Networks

Sep 30, 2023

Sourajit Das, Navid NaderiAlizadeh, Alejandro Ribeiro

Figure 1 for Learning State-Augmented Policies for Information Routing in Communication Networks

Figure 2 for Learning State-Augmented Policies for Information Routing in Communication Networks

Figure 3 for Learning State-Augmented Policies for Information Routing in Communication Networks

Figure 4 for Learning State-Augmented Policies for Information Routing in Communication Networks

Abstract:This paper examines the problem of information routing in a large-scale communication network, which can be formulated as a constrained statistical learning problem having access to only local information. We delineate a novel State Augmentation (SA) strategy to maximize the aggregate information at source nodes using graph neural network (GNN) architectures, by deploying graph convolutions over the topological links of the communication network. The proposed technique leverages only the local information available at each node and efficiently routes desired information to the destination nodes. We leverage an unsupervised learning procedure to convert the output of the GNN architecture to optimal information routing strategies. In the experiments, we perform the evaluation on real-time network topologies to validate our algorithms. Numerical simulations depict the improved performance of the proposed method in training a GNN parameterization as compared to baseline algorithms.

* 13 pages, 11 figures

Via

Access Paper or Ask Questions

Primal-Dual Continual Learning: Stability and Plasticity through Lagrange Multipliers

Sep 29, 2023

Juan Elenter, Navid NaderiAlizadeh, Tara Javidi, Alejandro Ribeiro

Figure 1 for Primal-Dual Continual Learning: Stability and Plasticity through Lagrange Multipliers

Figure 2 for Primal-Dual Continual Learning: Stability and Plasticity through Lagrange Multipliers

Figure 3 for Primal-Dual Continual Learning: Stability and Plasticity through Lagrange Multipliers

Figure 4 for Primal-Dual Continual Learning: Stability and Plasticity through Lagrange Multipliers

Abstract:Continual learning is inherently a constrained learning problem. The goal is to learn a predictor under a \emph{no-forgetting} requirement. Although several prior studies formulate it as such, they do not solve the constrained problem explicitly. In this work, we show that it is both possible and beneficial to undertake the constrained optimization problem directly. To do this, we leverage recent results in constrained learning through Lagrangian duality. We focus on memory-based methods, where a small subset of samples from previous tasks can be stored in a replay buffer. In this setting, we analyze two versions of the continual learning problem: a coarse approach with constraints at the task level and a fine approach with constraints at the sample level. We show that dual variables indicate the sensitivity of the optimal value with respect to constraint perturbations. We then leverage this result to partition the buffer in the coarse approach, allocating more resources to harder tasks, and to populate the buffer in the fine approach, including only impactful samples. We derive sub-optimality bounds, and empirically corroborate our theoretical results in various continual learning benchmarks. We also discuss the limitations of these methods with respect to the amount of memory available and the number of constraints involved in the optimization problem.

Via

Access Paper or Ask Questions

Navigation with shadow prices to optimize multi-commodity flow rates

Sep 25, 2023

Ignacio Boero, Igor Spasojevic, Mariana del Castillo, George Pappas, Vijay Kumar, Alejandro Ribeiro

Figure 1 for Navigation with shadow prices to optimize multi-commodity flow rates

Figure 2 for Navigation with shadow prices to optimize multi-commodity flow rates

Figure 3 for Navigation with shadow prices to optimize multi-commodity flow rates

Figure 4 for Navigation with shadow prices to optimize multi-commodity flow rates

Abstract:We propose a method for providing communication network infrastructure in autonomous multi-agent teams. In particular, we consider a set of communication agents that are placed alongside regular agents from the system in order to improve the rate of information transfer between the latter. In order to find the optimal positions to place such agents, we define a flexible performance function that adapts to network requirements for different systems. We provide an algorithm based on shadow prices of a related convex optimization problem in order to drive the configuration of the complete system towards a local maximum. We apply our method to three different performance functions associated with three practical scenarios in which we show both the performance of the algorithm and the flexibility it allows for optimizing different network requirements.

* (c) 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Via

Access Paper or Ask Questions

Asynchronous Perception-Action-Communication with Graph Neural Networks

Sep 18, 2023

Saurav Agarwal, Alejandro Ribeiro, Vijay Kumar

Figure 1 for Asynchronous Perception-Action-Communication with Graph Neural Networks

Figure 2 for Asynchronous Perception-Action-Communication with Graph Neural Networks

Figure 3 for Asynchronous Perception-Action-Communication with Graph Neural Networks

Figure 4 for Asynchronous Perception-Action-Communication with Graph Neural Networks

Abstract:Collaboration in large robot swarms to achieve a common global objective is a challenging problem in large environments due to limited sensing and communication capabilities. The robots must execute a Perception-Action-Communication (PAC) loop -- they perceive their local environment, communicate with other robots, and take actions in real time. A fundamental challenge in decentralized PAC systems is to decide what information to communicate with the neighboring robots and how to take actions while utilizing the information shared by the neighbors. Recently, this has been addressed using Graph Neural Networks (GNNs) for applications such as flocking and coverage control. Although conceptually, GNN policies are fully decentralized, the evaluation and deployment of such policies have primarily remained centralized or restrictively decentralized. Furthermore, existing frameworks assume sequential execution of perception and action inference, which is very restrictive in real-world applications. This paper proposes a framework for asynchronous PAC in robot swarms, where decentralized GNNs are used to compute navigation actions and generate messages for communication. In particular, we use aggregated GNNs, which enable the exchange of hidden layer information between robots for computational efficiency and decentralized inference of actions. Furthermore, the modules in the framework are asynchronous, allowing robots to perform sensing, extracting information, communication, action inference, and control execution at different frequencies. We demonstrate the effectiveness of GNNs executed in the proposed framework in navigating large robot swarms for collaborative coverage of large environments.

* Under review: IEEE International Conference on Robotics and Automation (ICRA) 2024

Via

Access Paper or Ask Questions

Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

Aug 13, 2023

Igor Spasojevic, Xu Liu, Ankit Prabhu, Alejandro Ribeiro, George J. Pappas, Vijay Kumar

Figure 1 for Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

Figure 2 for Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

Figure 3 for Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

Figure 4 for Robust Localization of Aerial Vehicles via Active Control of Identical Ground Vehicles

Abstract:This paper addresses the problem of active collaborative localization in heterogeneous robot teams with unknown data association. It involves positioning a small number of identical unmanned ground vehicles (UGVs) at desired positions so that an unmanned aerial vehicle (UAV) can, through unlabelled measurements of UGVs, uniquely determine its global pose. We model the problem as a sequential two player game, in which the first player positions the UGVs and the second identifies the two distinct hypothetical poses of the UAV at which the sets of measurements to the UGVs differ by as little as possible. We solve the underlying problem from the vantage point of the first player for a subclass of measurement models using a mixture of local optimization and exhaustive search procedures. Real-world experiments with a team of UAV and UGVs show that our method can achieve centimeter-level global localization accuracy. We also show that our method consistently outperforms random positioning of UGVs by a large margin, with as much as a 90% reduction in position and angular estimation error. Our method can tolerate a significant amount of random as well as non-stochastic measurement noise. This indicates its potential for reliable state estimation on board size, weight, and power (SWaP) constrained UAVs. This work enables robust localization in perceptually-challenged GPS-denied environments, thus paving the road for large-scale multi-robot navigation and mapping.

* To appear in IROS 2023

Via

Access Paper or Ask Questions

Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Jul 21, 2023

Damian Owerko, Charilaos I. Kanatsoulis, Jennifer Bondarchuk, Donald J. Bucci Jr, Alejandro Ribeiro

Figure 1 for Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Figure 2 for Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Figure 3 for Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Figure 4 for Transferability of Convolutional Neural Networks in Stationary Learning Tasks

Abstract:Recent advances in hardware and big data acquisition have accelerated the development of deep learning techniques. For an extended period of time, increasing the model complexity has led to performance improvements for various tasks. However, this trend is becoming unsustainable and there is a need for alternative, computationally lighter methods. In this paper, we introduce a novel framework for efficient training of convolutional neural networks (CNNs) for large-scale spatial problems. To accomplish this we investigate the properties of CNNs for tasks where the underlying signals are stationary. We show that a CNN trained on small windows of such signals achieves a nearly performance on much larger windows without retraining. This claim is supported by our theoretical analysis, which provides a bound on the performance degradation. Additionally, we conduct thorough experimental analysis on two tasks: multi-target tracking and mobile infrastructure on demand. Our results show that the CNN is able to tackle problems with many hundreds of agents after being trained with fewer than ten. Thus, CNN architectures provide solutions to these problems at previously computationally intractable scales.

* 14 pages, 7 figures, for associated code see https://github.com/damowerko/mtt

Via

Access Paper or Ask Questions

Intrinsically motivated graph exploration using network theories of human curiosity

Jul 13, 2023

Shubhankar P. Patankar, Mathieu Ouellet, Juan Cervino, Alejandro Ribeiro, Kieran A. Murphy, Dani S. Bassett

Figure 1 for Intrinsically motivated graph exploration using network theories of human curiosity

Figure 2 for Intrinsically motivated graph exploration using network theories of human curiosity

Figure 3 for Intrinsically motivated graph exploration using network theories of human curiosity

Figure 4 for Intrinsically motivated graph exploration using network theories of human curiosity

Abstract:Intrinsically motivated exploration has proven useful for reinforcement learning, even without additional extrinsic rewards. When the environment is naturally represented as a graph, how to guide exploration best remains an open question. In this work, we propose a novel approach for exploring graph-structured data motivated by two theories of human curiosity: the information gap theory and the compression progress theory. The theories view curiosity as an intrinsic motivation to optimize for topological features of subgraphs induced by the visited nodes in the environment. We use these proposed features as rewards for graph neural-network-based reinforcement learning. On multiple classes of synthetically generated graphs, we find that trained agents generalize to larger environments and to longer exploratory walks than are seen during training. Our method computes more efficiently than the greedy evaluation of the relevant topological properties. The proposed intrinsic motivations bear particular relevance for recommender systems. We demonstrate that curiosity-based recommendations are more predictive of human behavior than PageRank centrality for several real-world graph datasets, including MovieLens, Amazon Books, and Wikispeedia.

* 14 pages, 5 figures in main text, and 15 pages, 8 figures in supplement

Via

Access Paper or Ask Questions

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Jun 20, 2023

Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

Abstract:We study the problem of computing an optimal policy of an infinite-horizon discounted constrained Markov decision process (constrained MDP). Despite the popularity of Lagrangian-based policy search methods used in practice, the oscillation of policy iterates in these methods has not been fully understood, bringing out issues such as violation of constraints and sensitivity to hyper-parameters. To fill this gap, we employ the Lagrangian method to cast a constrained MDP into a constrained saddle-point problem in which max/min players correspond to primal/dual variables, respectively, and develop two single-time-scale policy-based primal-dual algorithms with non-asymptotic convergence of their policy iterates to an optimal constrained policy. Specifically, we first propose a regularized policy gradient primal-dual (RPG-PD) method that updates the policy using an entropy-regularized policy gradient, and the dual via a quadratic-regularized gradient ascent, simultaneously. We prove that the policy primal-dual iterates of RPG-PD converge to a regularized saddle point with a sublinear rate, while the policy iterates converge sublinearly to an optimal constrained policy. We further instantiate RPG-PD in large state or action spaces by including function approximation in policy parametrization, and establish similar sublinear last-iterate policy convergence. Second, we propose an optimistic policy gradient primal-dual (OPG-PD) method that employs the optimistic gradient method to update primal/dual variables, simultaneously. We prove that the policy primal-dual iterates of OPG-PD converge to a saddle point that contains an optimal constrained policy, with a linear rate. To the best of our knowledge, this work appears to be the first non-asymptotic policy last-iterate convergence result for single-time-scale algorithms in constrained MDPs.

* 78 pages, 17 figures, and 1 table

Via

Access Paper or Ask Questions