Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pushmeet Kohli

Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

May 22, 2018
Rudy Bunel, Matthew Hausknecht, Jacob Devlin, Rishabh Singh, Pushmeet Kohli

Figure 1 for Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Figure 2 for Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Figure 3 for Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Figure 4 for Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Program synthesis is the task of automatically generating a program consistent with a specification. Recent years have seen proposal of a number of neural approaches for program synthesis, many of which adopt a sequence generation paradigm similar to neural machine translation, in which sequence-to-sequence models are trained to maximize the likelihood of known reference programs. While achieving impressive results, this strategy has two key limitations. First, it ignores Program Aliasing: the fact that many different programs may satisfy a given specification (especially with incomplete specifications such as a few input-output examples). By maximizing the likelihood of only a single reference program, it penalizes many semantically correct programs, which can adversely affect the synthesizer performance. Second, this strategy overlooks the fact that programs have a strict syntax that can be efficiently checked. To address the first limitation, we perform reinforcement learning on top of a supervised model with an objective that explicitly maximizes the likelihood of generating semantically correct programs. For addressing the second limitation, we introduce a training procedure that directly maximizes the probability of generating syntactically correct programs that fulfill the specification. We show that our contributions lead to improved accuracy of the models, especially in cases where the training data is limited.

* ICLR 2018

Via

Access Paper or Ask Questions

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

May 21, 2018
Jakob Foerster, Nantas Nardelli, Gregory Farquhar, Triantafyllos Afouras, Philip H. S. Torr, Pushmeet Kohli, Shimon Whiteson

Figure 1 for Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Figure 2 for Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Figure 3 for Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Figure 4 for Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems. However, existing multi-agent RL methods typically scale poorly in the problem size. Therefore, a key challenge is to translate the success of deep learning on single-agent RL to the multi-agent setting. A major stumbling block is that independent Q-learning, the most popular multi-agent RL method, introduces nonstationarity that makes it incompatible with the experience replay memory on which deep Q-learning relies. This paper proposes two methods that address this problem: 1) using a multi-agent variant of importance sampling to naturally decay obsolete data and 2) conditioning each agent's value function on a fingerprint that disambiguates the age of the data sampled from the replay memory. Results on a challenging decentralised variant of StarCraft unit micromanagement confirm that these methods enable the successful combination of experience replay with multi-agent RL.

* Camera-ready version, International Conference of Machine Learning 2017; updated to fix print-breaking image

Via

Access Paper or Ask Questions

Batched Large-scale Bayesian Optimization in High-dimensional Spaces

May 16, 2018
Zi Wang, Clement Gehring, Pushmeet Kohli, Stefanie Jegelka

Figure 1 for Batched Large-scale Bayesian Optimization in High-dimensional Spaces

Figure 2 for Batched Large-scale Bayesian Optimization in High-dimensional Spaces

Figure 3 for Batched Large-scale Bayesian Optimization in High-dimensional Spaces

Figure 4 for Batched Large-scale Bayesian Optimization in High-dimensional Spaces

Bayesian optimization (BO) has become an effective approach for black-box function optimization problems when function evaluations are expensive and the optimum can be achieved within a relatively small number of queries. However, many cases, such as the ones with high-dimensional inputs, may require a much larger number of observations for optimization. Despite an abundance of observations thanks to parallel experiments, current BO techniques have been limited to merely a few thousand observations. In this paper, we propose ensemble Bayesian optimization (EBO) to address three current challenges in BO simultaneously: (1) large-scale observations; (2) high dimensional input spaces; and (3) selections of batch queries that balance quality and diversity. The key idea of EBO is to operate on an ensemble of additive Gaussian process models, each of which possesses a randomized strategy to divide and conquer. We show unprecedented, previously impossible results of scaling up BO to tens of thousands of observations within minutes of computation.

* Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS) 2018, Lanzarote, Spain

Via

Access Paper or Ask Questions

Can Neural Networks Understand Logical Entailment?

Feb 23, 2018
Richard Evans, David Saxton, David Amos, Pushmeet Kohli, Edward Grefenstette

Figure 1 for Can Neural Networks Understand Logical Entailment?

Figure 2 for Can Neural Networks Understand Logical Entailment?

Figure 3 for Can Neural Networks Understand Logical Entailment?

Figure 4 for Can Neural Networks Understand Logical Entailment?

We introduce a new dataset of logical entailments for the purpose of measuring models' ability to capture and exploit the structure of logical expressions against an entailment prediction task. We use this task to compare a series of architectures which are ubiquitous in the sequence-processing literature, in addition to a new model class---PossibleWorldNets---which computes entailment as a "convolution over possible worlds". Results show that convolutional networks present the wrong inductive bias for this class of problems relative to LSTM RNNs, tree-structured neural networks outperform LSTM RNNs due to their enhanced ability to exploit the syntax of logic, and PossibleWorldNets outperform all benchmarks.

* Published at ICLR 2018 (main conference)

Via

Access Paper or Ask Questions

Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

Jan 06, 2018
Zi Wang, Chengtao Li, Stefanie Jegelka, Pushmeet Kohli

Figure 1 for Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

Figure 2 for Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

Figure 3 for Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

Figure 4 for Batched High-dimensional Bayesian Optimization via Structural Kernel Learning

Optimization of high-dimensional black-box functions is an extremely challenging problem. While Bayesian optimization has emerged as a popular approach for optimizing black-box functions, its applicability has been limited to low-dimensional problems due to its computational and statistical challenges arising from high-dimensional settings. In this paper, we propose to tackle these challenges by (1) assuming a latent additive structure in the function and inferring it properly for more efficient and effective BO, and (2) performing multiple evaluations in parallel to reduce the number of iterations required by the method. Our novel approach learns the latent structure with Gibbs sampling and constructs batched queries using determinantal point processes. Experimental validations on both synthetic and real-world functions demonstrate that the proposed method outperforms the existing state-of-the-art approaches.

* Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, PMLR 70, 2017

Via

Access Paper or Ask Questions

Learning Disentangled Representations with Semi-Supervised Deep Generative Models

Nov 13, 2017
N. Siddharth, Brooks Paige, Jan-Willem van de Meent, Alban Desmaison, Noah D. Goodman, Pushmeet Kohli, Frank Wood, Philip H. S. Torr

Figure 1 for Learning Disentangled Representations with Semi-Supervised Deep Generative Models

Figure 2 for Learning Disentangled Representations with Semi-Supervised Deep Generative Models

Figure 3 for Learning Disentangled Representations with Semi-Supervised Deep Generative Models

Figure 4 for Learning Disentangled Representations with Semi-Supervised Deep Generative Models

Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectures that generalise from standard VAEs, employing a general graphical model structure in the encoder and decoder. This allows us to train partially-specified models that make relatively strong assumptions about a subset of interpretable variables and rely on the flexibility of neural networks to learn representations for the remaining variables. We further define a general objective for semi-supervised learning in this model class, which can be approximated using an importance sampling procedure. We evaluate our framework's ability to learn disentangled representations, both by qualitative exploration of its generative capacity, and quantitative evaluation of its discriminative ability on a variety of models and datasets.

* Accepted for publication at NIPS 2017

Via

Access Paper or Ask Questions

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Nov 07, 2017
Junhyuk Oh, Satinder Singh, Honglak Lee, Pushmeet Kohli

Figure 1 for Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Figure 2 for Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Figure 3 for Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

Figure 4 for Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

As a step towards developing zero-shot task generalization capabilities in reinforcement learning (RL), we introduce a new RL problem where the agent should learn to execute sequences of instructions after learning useful skills that solve subtasks. In this problem, we consider two types of generalizations: to previously unseen instructions and to longer sequences of instructions. For generalization over unseen instructions, we propose a new objective which encourages learning correspondences between similar subtasks by making analogies. For generalization over sequential instructions, we present a hierarchical architecture where a meta controller learns to use the acquired skills for executing the instructions. To deal with delayed reward, we propose a new neural architecture in the meta controller that learns when to update the subtask, which makes learning more efficient. Experimental results on a stochastic 3D domain show that the proposed ideas are crucial for generalization to longer instructions as well as unseen instructions.

* ICML 2017

Via

Access Paper or Ask Questions

Semantic Code Repair using Neuro-Symbolic Transformation Networks

Oct 30, 2017
Jacob Devlin, Jonathan Uesato, Rishabh Singh, Pushmeet Kohli

Figure 1 for Semantic Code Repair using Neuro-Symbolic Transformation Networks

Figure 2 for Semantic Code Repair using Neuro-Symbolic Transformation Networks

Figure 3 for Semantic Code Repair using Neuro-Symbolic Transformation Networks

Figure 4 for Semantic Code Repair using Neuro-Symbolic Transformation Networks

We study the problem of semantic code repair, which can be broadly defined as automatically fixing non-syntactic bugs in source code. The majority of past work in semantic code repair assumed access to unit tests against which candidate repairs could be validated. In contrast, the goal here is to develop a strong statistical model to accurately predict both bug locations and exact fixes without access to information about the intended correct behavior of the program. Achieving such a goal requires a robust contextual repair model, which we train on a large corpus of real-world source code that has been augmented with synthetically injected bugs. Our framework adopts a two-stage approach where first a large set of repair candidates are generated by rule-based processors, and then these candidates are scored by a statistical model using a novel neural network architecture which we refer to as Share, Specialize, and Compete. Specifically, the architecture (1) generates a shared encoding of the source code using an RNN over the abstract syntax tree, (2) scores each candidate repair using specialized network modules, and (3) then normalizes these scores together so they can compete against one another in comparable probability space. We evaluate our model on a real-world test set gathered from GitHub containing four common categories of bugs. Our model is able to predict the exact correct repair 41\% of the time with a single guess, compared to 13\% accuracy for an attentional sequence-to-sequence model.

Via

Access Paper or Ask Questions

Neural Program Meta-Induction

Oct 11, 2017
Jacob Devlin, Rudy Bunel, Rishabh Singh, Matthew Hausknecht, Pushmeet Kohli

Figure 1 for Neural Program Meta-Induction

Figure 2 for Neural Program Meta-Induction

Figure 3 for Neural Program Meta-Induction

Figure 4 for Neural Program Meta-Induction

Most recently proposed methods for Neural Program Induction work under the assumption of having a large set of input/output (I/O) examples for learning any underlying input-output mapping. This paper aims to address the problem of data and computation efficiency of program induction by leveraging information from related tasks. Specifically, we propose two approaches for cross-task knowledge transfer to improve program induction in limited-data scenarios. In our first proposal, portfolio adaptation, a set of induction models is pretrained on a set of related tasks, and the best model is adapted towards the new task using transfer learning. In our second approach, meta program induction, a $k$-shot learning approach is used to make a model generalize to new tasks without additional training. To test the efficacy of our methods, we constructed a new benchmark of programs written in the Karel programming language. Using an extensive experimental evaluation on the Karel benchmark, we demonstrate that our proposals dramatically outperform the baseline induction method that does not use knowledge transfer. We also analyze the relative performance of the two approaches and study conditions in which they perform best. In particular, meta induction outperforms all existing approaches under extreme data sparsity (when a very small number of examples are available), i.e., fewer than ten. As the number of available I/O examples increase (i.e. a thousand or more), portfolio adapted program induction becomes the best approach. For intermediate data sizes, we demonstrate that the combined method of adapted meta program induction has the strongest performance.

* 8 Pages + 1 page appendix

Via

Access Paper or Ask Questions

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Aug 16, 2017
Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao

Figure 1 for DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Figure 2 for DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Figure 3 for DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Figure 4 for DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue for scene understanding - yet very little research has been done on integrating context information with deep models. This paper presents an approach to embed 3D context into the topology of a neural network trained to perform holistic scene understanding. Given a depth image depicting a 3D scene, our network aligns the observed scene with a predefined 3D scene template, and then reasons about the existence and location of each object within the scene template. In doing so, our model recognizes multiple objects in a single forward pass of a 3D convolutional neural network, capturing both global scene and local object information simultaneously. To create training data for this 3D network, we generate partly hallucinated depth images which are rendered by replacing real objects with a repository of CAD models of the same object category. Extensive experiments demonstrate the effectiveness of our algorithm compared to the state-of-the-arts. Source code and data are available at http://deepcontext.cs.princeton.edu.

* Accepted by ICCV2017

Via

Access Paper or Ask Questions