Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andreas Krause

Safe Contextual Bayesian Optimization for Sustainable Room Temperature PID Control Tuning

Jun 28, 2019
Marcello Fiducioso, Sebastian Curi, Benedikt Schumacher, Markus Gwerder, Andreas Krause

Figure 1 for Safe Contextual Bayesian Optimization for Sustainable Room Temperature PID Control Tuning

Figure 2 for Safe Contextual Bayesian Optimization for Sustainable Room Temperature PID Control Tuning

Figure 3 for Safe Contextual Bayesian Optimization for Sustainable Room Temperature PID Control Tuning

Figure 4 for Safe Contextual Bayesian Optimization for Sustainable Room Temperature PID Control Tuning

We tune one of the most common heating, ventilation, and air conditioning (HVAC) control loops, namely the temperature control of a room. For economical and environmental reasons, it is of prime importance to optimize the performance of this system. Buildings account from 20 to 40% of a country energy consumption, and almost 50% of it comes from HVAC systems. Scenario projections predict a 30% decrease in heating consumption by 2050 due to efficiency increase. Advanced control techniques can improve performance; however, the proportional-integral-derivative (PID) control is typically used due to its simplicity and overall performance. We use Safe Contextual Bayesian Optimization to optimize the PID parameters without human intervention. We reduce costs by 32% compared to the current PID controller setting while assuring safety and comfort to people in the room. The results of this work have an immediate impact on the room control loop performances and its related commissioning costs. Furthermore, this successful attempt paves the way for further use at different levels of HVAC systems, with promising energy, operational, and commissioning costs savings, and it is a practical demonstration of the positive effects that Artificial Intelligence can have on environmental sustainability.

* 7 pages; Published in IJCAI 19

Via

Access Paper or Ask Questions

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Jun 27, 2019
Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Joschka Boedecker, Andreas Krause

Figure 1 for Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Figure 2 for Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Figure 3 for Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Figure 4 for Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Reinforcement learning has been successfully used to solve difficult tasks in complex unknown environments. However, these methods typically do not provide any safety guarantees during the learning process. This is particularly problematic, since reinforcement learning agent actively explore their environment. This prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that provides high-probability safety guarantees throughout the learning process. Based on a reliable statistical model, we construct provably accurate confidence intervals on predicted trajectories. Unlike previous approaches, we allow for input-dependent uncertainties. Based on these reliable predictions, we guarantee that trajectories satisfy safety constraints. Moreover, we use a terminal set constraint to recursively guarantee the existence of safe control actions at every iteration. We evaluate the resulting algorithm to safely explore the dynamics of an inverted pendulum and to solve a reinforcement learning task on a cart-pole system with safety constraints.

* 14 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1803.08287

Via

Access Paper or Ask Questions

Stochastic Bandits with Context Distributions

Jun 06, 2019
Johannes Kirschner, Andreas Krause

Figure 1 for Stochastic Bandits with Context Distributions

We introduce a novel stochastic contextual bandit model, where at each step the adversary chooses a distribution over a context set. The learner observes only the context distribution while the exact context realization remains hidden. This allows for a broader range of applications, for instance when the context itself is based on predictions. By leveraging the UCB algorithm to this setting, we propose an algorithm that achieves a $\tilde{\mathcal{O}}(d\sqrt{T})$ high-probability regret bound for linearly parametrized reward functions. Our results strictly generalize previous work in the sense that both our model and the algorithm reduce to the standard setting when the environment chooses only Dirac delta distributions and therefore provides the exact context to the learner. We further obtain similar results for a variant where the learner observes the realized context after choosing the action, and we extend the results to the kernelized setting. Finally, we demonstrate the proposed method on synthetic and real-world datasets.

Via

Access Paper or Ask Questions

Learning Generative Models across Incomparable Spaces

May 15, 2019
Charlotte Bunne, David Alvarez-Melis, Andreas Krause, Stefanie Jegelka

Figure 1 for Learning Generative Models across Incomparable Spaces

Figure 2 for Learning Generative Models across Incomparable Spaces

Figure 3 for Learning Generative Models across Incomparable Spaces

Figure 4 for Learning Generative Models across Incomparable Spaces

Generative Adversarial Networks have shown remarkable success in learning a distribution that faithfully recovers a reference distribution in its entirety. However, in some cases, we may want to only learn some aspects (e.g., cluster or manifold structure), while modifying others (e.g., style, orientation or dimension). In this work, we propose an approach to learn generative models across such incomparable spaces, and demonstrate how to steer the learned distribution towards target properties. A key component of our model is the Gromov-Wasserstein distance, a notion of discrepancy that compares distributions relationally rather than absolutely. While this framework subsumes current generative models in identically reproducing distributions, its inherent flexibility allows application to tasks in manifold learning, relational learning and cross-domain learning.

* Proceedings of Machine Learning Research (PMLR), 97 (2019)
* International Conference on Machine Learning (ICML)

Via

Access Paper or Ask Questions

Online Variance Reduction with Mixtures

Mar 29, 2019
Zalán Borsos, Sebastian Curi, Kfir Y. Levy, Andreas Krause

Figure 1 for Online Variance Reduction with Mixtures

Figure 2 for Online Variance Reduction with Mixtures

Figure 3 for Online Variance Reduction with Mixtures

Figure 4 for Online Variance Reduction with Mixtures

Adaptive importance sampling for stochastic optimization is a promising approach that offers improved convergence through variance reduction. In this work, we propose a new framework for variance reduction that enables the use of mixtures over predefined sampling distributions, which can naturally encode prior knowledge about the data. While these sampling distributions are fixed, the mixture weights are adapted during the optimization process. We propose VRM, a novel and efficient adaptive scheme that asymptotically recovers the best mixture weights in hindsight and can also accommodate sampling distributions over sets of points. We empirically demonstrate the versatility of VRM in a range of applications.

Via

Access Paper or Ask Questions

AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

Feb 22, 2019
Gabriele Abbati, Philippe Wenk, Stefan Bauer, Michael A Osborne, Andreas Krause, Bernhard Schölkopf

Figure 1 for AReS and MaRS - Adversarial and MMD-Minimizing Regression for SDEs

Stochastic differential equations are an important modeling class in many disciplines. Consequently, there exist many methods relying on various discretization and numerical integration schemes. In this paper, we propose a novel, probabilistic model for estimating the drift and diffusion given noisy observations of the underlying stochastic system. Using state-of-the-art adversarial and moment matching inference techniques, we circumvent the use of the discretization schemes as seen in classical approaches. This yields significant improvements in parameter estimation accuracy and robustness given random initial guesses. On four commonly used benchmark systems, we demonstrate the performance of our algorithms compared to state-of-the-art solutions based on extended Kalman filtering and Gaussian processes.

Via

Access Paper or Ask Questions

Multi-Player Bandits: The Adversarial Case

Feb 21, 2019
Pragnya Alatur, Kfir Y. Levy, Andreas Krause

Figure 1 for Multi-Player Bandits: The Adversarial Case

Figure 2 for Multi-Player Bandits: The Adversarial Case

Figure 3 for Multi-Player Bandits: The Adversarial Case

Figure 4 for Multi-Player Bandits: The Adversarial Case

We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a cognitive radio networks application, we assume that players incur a loss upon colliding, and that communication between players is not possible. Existing approaches assume that the system is stationary. Yet this assumption is often violated in practice, e.g., due to signal strength fluctuations. In this work, we design the first Multi-player Bandit algorithm that provably works in arbitrarily changing environments, where the losses of the arms may even be chosen by an adversary. This resolves an open problem posed by Rosenski, Shamir, and Szlak (2016).

Via

Access Paper or Ask Questions

ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Feb 17, 2019
Philippe Wenk, Gabriele Abbati, Stefan Bauer, Michael A Osborne, Andreas Krause, Bernhard Schölkopf

Figure 1 for ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Figure 2 for ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Figure 3 for ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Figure 4 for ODIN: ODE-Informed Regression for Parameter and State Inference in Time-Continuous Dynamical Systems

Parameter inference in ordinary differential equations is an important problem in many applied sciences and in engineering, especially in a data-scarce setting. In this work, we introduce a novel generative modeling approach based on constrained Gaussian processes and use it to create a computationally and data efficient algorithm for state and parameter inference. In an extensive set of experiments, our approach outperforms its competitors both in terms of accuracy and computational cost for parameter inference. It also shows promising results for the much more challenging problem of model selection.

Via

Access Paper or Ask Questions

Adaptive Sequence Submodularity

Feb 15, 2019
Marko Mitrovic, Ehsan Kazemi, Moran Feldman, Andreas Krause, Amin Karbasi

Figure 1 for Adaptive Sequence Submodularity

Figure 2 for Adaptive Sequence Submodularity

Figure 3 for Adaptive Sequence Submodularity

Figure 4 for Adaptive Sequence Submodularity

In many machine learning applications, one needs to interactively select a sequence of items (e.g., recommending movies based on a user's feedback) or make sequential decisions in certain orders (e.g., guiding an agent through a series of states). Not only do sequences already pose a dauntingly large search space, but we must take into account past observations, as well as the uncertainty of future outcomes. Without further structure, finding an optimal sequence is notoriously challenging, if not completely intractable. In this paper, we introduce adaptive sequence submodularity, a rich framework that generalizes the notion of submodularity to adaptive policies that explicitly consider sequential dependencies between items. We show that once such dependencies are encoded by a directed graph, an adaptive greedy policy is guaranteed to achieve a constant factor approximation guarantee, where the constant naturally depends on the structural properties of the underlying graph. Additionally, to demonstrate the practical utility of our results, we run experiments on Amazon product recommendation and Wikipedia link prediction tasks.

Via

Access Paper or Ask Questions

Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Feb 08, 2019
Johannes Kirschner, Mojmír Mutný, Nicole Hiller, Rasmus Ischebeck, Andreas Krause

Figure 1 for Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Figure 2 for Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Figure 3 for Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Figure 4 for Adaptive and Safe Bayesian Optimization in High Dimensions via One-Dimensional Subspaces

Bayesian optimization is known to be difficult to scale to high dimensions, because the acquisition step requires solving a non-convex optimization problem in the same search space. In order to scale the method and keep its benefits, we propose an algorithm (LineBO) that restricts the problem to a sequence of iteratively chosen one-dimensional sub-problems. We show that our algorithm converges globally and obtains a fast local rate when the function is strongly convex. Further, if the objective has an invariant subspace, our method automatically adapts to the effective dimension without changing the algorithm. Our method scales well to high dimensions and makes use of a global Gaussian process model. When combined with the SafeOpt algorithm to solve the sub-problems, we obtain the first safe Bayesian optimization algorithm with theoretical guarantees applicable in high-dimensional settings. We evaluate our method on multiple synthetic benchmarks, where we obtain competitive performance. Further, we deploy our algorithm to optimize the beam intensity of the Swiss Free Electron Laser with up to 40 parameters while satisfying safe operation constraints.

Via

Access Paper or Ask Questions