Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rose Yu

DeepGLEAM: a hybrid mechanistic and deep learning model for COVID-19 forecasting

Feb 15, 2021

Dongxia Wu, Liyao Gao, Xinyue Xiong, Matteo Chinazzi, Alessandro Vespignani, Yian Ma, Rose Yu

Figure 1 for DeepGLEAM: a hybrid mechanistic and deep learning model for COVID-19 forecasting

Figure 2 for DeepGLEAM: a hybrid mechanistic and deep learning model for COVID-19 forecasting

Figure 3 for DeepGLEAM: a hybrid mechanistic and deep learning model for COVID-19 forecasting

Figure 4 for DeepGLEAM: a hybrid mechanistic and deep learning model for COVID-19 forecasting

Abstract:We introduce DeepGLEAM, a hybrid model for COVID-19 forecasting. DeepGLEAM combines a mechanistic stochastic simulation model GLEAM with deep learning. It uses deep learning to learn the correction terms from GLEAM, which leads to improved performance. We further integrate various uncertainty quantification methods to generate confidence intervals. We demonstrate DeepGLEAM on real-world COVID-19 mortality forecasting tasks.

Via

Access Paper or Ask Questions

Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems

Nov 20, 2020

Rui Wang, Danielle Maddix, Christos Faloutsos, Yuyang Wang, Rose Yu

Figure 1 for Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems

Figure 2 for Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems

Figure 3 for Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems

Figure 4 for Bridging Physics-based and Data-driven modeling for Learning Dynamical Systems

Abstract:How can we learn a dynamical system to make forecasts, when some variables are unobserved? For instance, in COVID-19, we want to forecast the number of infected and death cases but we do not know the count of susceptible and exposed people. While mechanics compartment models are widely-used in epidemic modeling, data-driven models are emerging for disease forecasting. As a case study, we compare these two types of models for COVID-19 forecasting and notice that physics-based models significantly outperform deep learning models. We present a hybrid approach, AutoODE-COVID, which combines a novel compartmental model with automatic differentiation. Our method obtains a 57.4% reduction in mean absolute errors for 7-day ahead COVID-19 forecasting compared with the best deep learning competitor. To understand the inferior performance of deep learning, we investigate the generalization problem in forecasting. Through systematic experiments, we found that deep learning models fail to forecast under shifted distributions either in the data domain or the parameter domain. This calls attention to rethink generalization especially for learning dynamical systems.

Via

Access Paper or Ask Questions

Trajectory Prediction using Equivariant Continuous Convolution

Oct 21, 2020

Robin Walters, Jinxi Li, Rose Yu

Figure 1 for Trajectory Prediction using Equivariant Continuous Convolution

Figure 2 for Trajectory Prediction using Equivariant Continuous Convolution

Figure 3 for Trajectory Prediction using Equivariant Continuous Convolution

Figure 4 for Trajectory Prediction using Equivariant Continuous Convolution

Abstract:Trajectory prediction is a critical part of many AI applications, for example, the safe operation of autonomous vehicles. However, current methods are prone to making inconsistent and physically unrealistic predictions. We leverage insights from fluid dynamics to overcome this limitation by considering internal symmetry in trajectories. We propose a novel model, Equivariant Continous COnvolution (ECCO) for improved trajectory prediction. ECCO uses rotationally-equivariant continuous convolutions to embed the symmetries of the system. On two real-world vehicle and pedestrian trajectory datasets, ECCO attains competitive accuracy with significantly fewer parameters. It is also more sample efficient, generalizing automatically from few data points in any orientation. Lastly, ECCO improves generalization with equivariance, resulting in more physically consistent predictions. Our method provides a fresh perspective towards increasing trust and transparency in deep learning models.

* 16 pages

Via

Access Paper or Ask Questions

Deep Imitation Learning for Bimanual Robotic Manipulation

Oct 11, 2020

Fan Xie, Alexander Chowdhury, M. Clara De Paolis Kaluza, Linfeng Zhao, Lawson L. S. Wong, Rose Yu

Figure 1 for Deep Imitation Learning for Bimanual Robotic Manipulation

Figure 2 for Deep Imitation Learning for Bimanual Robotic Manipulation

Figure 3 for Deep Imitation Learning for Bimanual Robotic Manipulation

Figure 4 for Deep Imitation Learning for Bimanual Robotic Manipulation

Abstract:We present a deep imitation learning framework for robotic bimanual manipulation in a continuous state-action space. Imitation learning has been effectively utilized in mimicking bimanual manipulation movements, but generalizing the movement to objects in different locations has not been explored. We hypothesize that to precisely generalize the learned behavior relative to an object's location requires modeling relational information in the environment. To achieve this, we designed a method that (i) uses a multi-model framework to decomposes complex dynamics into elemental movement primitives, and (ii) parameterizes each primitive using a recurrent graph neural network to capture interactions. Our model is a deep, hierarchical, modular architecture with a high-level planner that learns to compose primitives sequentially and a low-level controller which integrates primitive dynamics modules and inverse kinematics control. We demonstrate the effectiveness using several simulated bimanual robotic manipulation tasks. Compared to models based on previous imitation learning studies, our model generalizes better and achieves higher success rates in the simulated tasks.

Via

Access Paper or Ask Questions

Dynamic Relational Inference in Multi-Agent Trajectories

Jul 16, 2020

Ruichao Xiao, Rose Yu

Figure 1 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 2 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 3 for Dynamic Relational Inference in Multi-Agent Trajectories

Figure 4 for Dynamic Relational Inference in Multi-Agent Trajectories

Abstract:Inferring interactions from multi-agent trajectories has broad applications in physics, vision and robotics. Neural relational inference (NRI) is a deep generative model that can reason about relations in complex dynamics without supervision. In this paper, we take a careful look at this approach for relational inference in multi-agent trajectories. First, we discover that NRI can be fundamentally limited without sufficient long-term observations. Its ability to accurately infer interactions degrades drastically for short output sequences. Next, we consider a more general setting of relational inference when interactions are changing overtime. We propose an extension ofNRI, which we call the DYnamic multi-AgentRelational Inference (DYARI) model that can reason about dynamic relations. We conduct exhaustive experiments to study the effect of model architecture, under-lying dynamics and training scheme on the performance of dynamic relational inference using a simulated physics system. We also showcase the usage of our model on real-world multi-agent basketball trajectories.

Via

Access Paper or Ask Questions

Finding Patient Zero: Learning Contagion Source with Graph Neural Networks

Jun 27, 2020

Chintan Shah, Nima Dehmamy, Nicola Perra, Matteo Chinazzi, Albert-László Barabási, Alessandro Vespignani, Rose Yu

Figure 1 for Finding Patient Zero: Learning Contagion Source with Graph Neural Networks

Figure 2 for Finding Patient Zero: Learning Contagion Source with Graph Neural Networks

Figure 3 for Finding Patient Zero: Learning Contagion Source with Graph Neural Networks

Figure 4 for Finding Patient Zero: Learning Contagion Source with Graph Neural Networks

Abstract:Locating the source of an epidemic, or patient zero (P0), can provide critical insights into the infection's transmission course and allow efficient resource allocation. Existing methods use graph-theoretic centrality measures and expensive message-passing algorithms, requiring knowledge of the underlying dynamics and its parameters. In this paper, we revisit this problem using graph neural networks (GNNs) to learn P0. We establish a theoretical limit for the identification of P0 in a class of epidemic models. We evaluate our method against different epidemic models on both synthetic and a real-world contact network considering a disease with history and characteristics of COVID-19. % We observe that GNNs can identify P0 close to the theoretical bound on accuracy, without explicit input of dynamics or its parameters. In addition, GNN is over 100 times faster than classic methods for inference on arbitrary graph topologies. Our theoretical bound also shows that the epidemic is like a ticking clock, emphasizing the importance of early contact-tracing. We find a maximum time after which accurate recovery of the source becomes impossible, regardless of the algorithm used.

Via

Access Paper or Ask Questions

Learning Disentangled Representations of Video with Missing Data

Jun 23, 2020

Armand Comas Massague, Chi Zhang, Zlatan Feric, Octavia Camps, Rose Yu

Figure 1 for Learning Disentangled Representations of Video with Missing Data

Figure 2 for Learning Disentangled Representations of Video with Missing Data

Figure 3 for Learning Disentangled Representations of Video with Missing Data

Figure 4 for Learning Disentangled Representations of Video with Missing Data

Abstract:Missing data poses significant challenges while learning representations of video sequences. We present Disentangled Imputed Video autoEncoder (DIVE), a deep generative model that imputes and predicts future video frames in the presence of missing data. Specifically, DIVE introduces a missingness latent variable, disentangles the hidden video representations into static and dynamic appearance, pose, and missingness factors for each object, while it imputes each object trajectory where data is missing. On a moving MNIST dataset with various missing scenarios, DIVE outperforms the state of the art baselines by a substantial margin. We also present comparisons for real-world MOTSChallenge pedestrian dataset, which demonstrates the practical value of our method in a more realistic setting.

Via

Access Paper or Ask Questions

Aortic Pressure Forecasting with Deep Sequence Learning

May 12, 2020

Rui Wang, Eliza Huang, Uma Chandrasekaran, Rose Yu

Figure 1 for Aortic Pressure Forecasting with Deep Sequence Learning

Figure 2 for Aortic Pressure Forecasting with Deep Sequence Learning

Figure 3 for Aortic Pressure Forecasting with Deep Sequence Learning

Figure 4 for Aortic Pressure Forecasting with Deep Sequence Learning

Abstract:Mean aortic pressure is a major determinant of perfusion in all organ systems. The ability to forecast the mean aortic pressure would enhance the ability of physicians to estimate prognosis of the patient and assist in early detection of hemodynamic instability. However, forecasting aortic pressure is challenging because the blood pressure time series is noisy and can be highly non-stationary. In this study, we provided a benchmark study of different deep sequence learning models on pump performance data obtained in patients who underwent high-risk percutaneous intervention with transvalvular micro-axial heart pump support. The aim of this study was to forecast the mean aortic pressure five minutes in advance, using the time series data of previous five minutes as input. We performed comprehensive study on time series with increasing, decreasing, and stationary trends. The experiments show promising results with the Legendre Memory Unit architecture achieving the best performance with an overall RMSE of 1.837 mmHg.

Via

Access Paper or Ask Questions

Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Mar 08, 2020

Rui Wang, Robin Walters, Rose Yu

Figure 1 for Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Figure 2 for Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Figure 3 for Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Figure 4 for Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Abstract:Training machine learning models that can learn complex spatiotemporal dynamics and generalize under distributional shift is a fundamental challenge. The symmetries in a physical system play a unique role in characterizing unchanged features under transformation. We propose a systematic approach to improve generalization in spatiotemporal models by incorporating symmetries into deep neural networks. Our general framework to design equivariant convolutional models employs (1) convolution with equivariant kernels, (2) conjugation by averaging operators in order to force equivariance, (3) and a naturally equivariant generalization of convolution called group correlation. Our framework is both theoretically and experimentally robust to distributional shift by a symmetry group and enjoys favorable sample complexity. We demonstrate the advantage of our approach on a variety of physical dynamics including turbulence and diffusion systems. This is the first time that equivariant CNNs have been used to forecast physical dynamics.

Via

Access Paper or Ask Questions

Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

Feb 15, 2020

Jung Yeon Park, Kenneth Theo Carr, Stephan Zheng, Yisong Yue, Rose Yu

Figure 1 for Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

Figure 2 for Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

Figure 3 for Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

Figure 4 for Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis

Abstract:Efficient and interpretable spatial analysis is crucial in many fields such as geology, sports, and climate science. Large-scale spatial data often contains complex higher-order correlations across features and locations. While tensor latent factor models can describe higher-order correlations, they are inherently computationally expensive to train. Furthermore, for spatial analysis, these models should not only be predictive but also be spatially coherent. However, latent factor models are sensitive to initialization and can yield inexplicable results. We develop a novel Multi-resolution Tensor Learning (MRTL) algorithm for efficiently learning interpretable spatial patterns. MRTL initializes the latent factors from an approximate full-rank tensor model for improved interpretability and progressively learns from a coarse resolution to the fine resolution for an enormous computation speedup. We also prove the theoretical convergence and computational complexity of MRTL. When applied to two real-world datasets, MRTL demonstrates 4 ~ 5 times speedup compared to a fixed resolution while yielding accurate and interpretable models.

Via

Access Paper or Ask Questions