Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yewen Pu

Neurosymbolic Transformers for Multi-Agent Communication

Jan 05, 2021
Jeevana Priya Inala, Yichen Yang, James Paulos, Yewen Pu, Osbert Bastani, Vijay Kumar, Martin Rinard, Armando Solar-Lezama

Figure 1 for Neurosymbolic Transformers for Multi-Agent Communication

Figure 2 for Neurosymbolic Transformers for Multi-Agent Communication

Figure 3 for Neurosymbolic Transformers for Multi-Agent Communication

Figure 4 for Neurosymbolic Transformers for Multi-Agent Communication

We study the problem of inferring communication structures that can solve cooperative multi-agent planning problems while minimizing the amount of communication. We quantify the amount of communication as the maximum degree of the communication graph; this metric captures settings where agents have limited bandwidth. Minimizing communication is challenging due to the combinatorial nature of both the decision space and the objective; for instance, we cannot solve this problem by training neural networks using gradient descent. We propose a novel algorithm that synthesizes a control policy that combines a programmatic communication policy used to generate the communication graph with a transformer policy network used to choose actions. Our algorithm first trains the transformer policy, which implicitly generates a "soft" communication graph; then, it synthesizes a programmatic communication policy that "hardens" this graph, forming a neurosymbolic transformer. Our experiments demonstrate how our approach can synthesize policies that generate low-degree communication graphs while maintaining near-optimal performance.

Via

Access Paper or Ask Questions

Representing Partial Programs with Blended Abstract Semantics

Dec 23, 2020
Maxwell Nye, Yewen Pu, Matthew Bowers, Jacob Andreas, Joshua B. Tenenbaum, Armando Solar-Lezama

Figure 1 for Representing Partial Programs with Blended Abstract Semantics

Figure 2 for Representing Partial Programs with Blended Abstract Semantics

Figure 3 for Representing Partial Programs with Blended Abstract Semantics

Figure 4 for Representing Partial Programs with Blended Abstract Semantics

Synthesizing programs from examples requires searching over a vast, combinatorial space of possible programs. In this search process, a key challenge is representing the behavior of a partially written program before it can be executed, to judge if it is on the right track and predict where to search next. We introduce a general technique for representing partially written programs in a program synthesis engine. We take inspiration from the technique of abstract interpretation, in which an approximate execution model is used to determine if an unfinished program will eventually satisfy a goal specification. Here we learn an approximate execution model implemented as a modular neural network. By constructing compositional program representations that implicitly encode the interpretation semantics of the underlying programming language, we can represent partial programs using a flexible combination of concrete execution state and learned neural representations, using the learned approximate semantics when concrete semantics are not known (in unfinished parts of the program). We show that these hybrid neuro-symbolic representations enable execution-guided synthesizers to use more powerful language constructs, such as loops and higher-order functions, and can be used to synthesize programs more accurately for a given search budget than pure neural approaches in several domains.

Via

Access Paper or Ask Questions

Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

Oct 05, 2020
Karl D. D. Willis, Yewen Pu, Jieliang Luo, Hang Chu, Tao Du, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech Matusik

Figure 1 for Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

Figure 2 for Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

Figure 3 for Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

Figure 4 for Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Reconstruction

Parametric computer-aided design (CAD) is a standard paradigm used for the design of manufactured objects. CAD designers perform modeling operations, such as sketch and extrude, to form a construction sequence that makes up a final design. Despite the pervasiveness of parametric CAD and growing interest from the research community, a dataset of human designed 3D CAD construction sequences has not been available to-date. In this paper we present the Fusion 360 Gallery reconstruction dataset and environment for learning CAD reconstruction. We provide a dataset of 8,625 designs, comprising sequential sketch and extrude modeling operations, together with a complementary environment called the Fusion 360 Gym, to assist with performing CAD reconstruction. We outline a standard CAD reconstruction task, together with evaluation metrics, and present results from a novel method using neurally guided search to recover a construction sequence from raw geometry.

Via

Access Paper or Ask Questions

Program Synthesis with Pragmatic Communication

Jul 09, 2020
Yewen Pu, Kevin Ellis, Marta Kryven, Josh Tenenbaum, Armando Solar-Lezama

Figure 1 for Program Synthesis with Pragmatic Communication

Figure 2 for Program Synthesis with Pragmatic Communication

Figure 3 for Program Synthesis with Pragmatic Communication

Figure 4 for Program Synthesis with Pragmatic Communication

Program synthesis techniques construct or infer programs from user-provided specifications, such as input-output examples. Yet most specifications, especially those given by end-users, leave the synthesis problem radically ill-posed, because many programs may simultaneously satisfy the specification. Prior work resolves this ambiguity by using various inductive biases, such as a preference for simpler programs. This work introduces a new inductive bias derived by modeling the program synthesis task as rational communication, drawing insights from recursive reasoning models of pragmatics. Given a specification, we score a candidate program both on its consistency with the specification, and also whether a rational speaker would chose this particular specification to communicate that program. We develop efficient algorithms for such an approach when learning from input-output examples, and build a pragmatic program synthesizer over a simple grid-like layout domain. A user study finds that end-user participants communicate more effectively with the pragmatic program synthesizer over a non-pragmatic one.

* The second author and the third author contributed equally to this work

Via

Access Paper or Ask Questions

Write, Execute, Assess: Program Synthesis with a REPL

Jun 09, 2019
Kevin Ellis, Maxwell Nye, Yewen Pu, Felix Sosa, Josh Tenenbaum, Armando Solar-Lezama

Figure 1 for Write, Execute, Assess: Program Synthesis with a REPL

Figure 2 for Write, Execute, Assess: Program Synthesis with a REPL

Figure 3 for Write, Execute, Assess: Program Synthesis with a REPL

Figure 4 for Write, Execute, Assess: Program Synthesis with a REPL

We present a neural program synthesis approach integrating components which write, execute, and assess code to navigate the search space of possible programs. We equip the search process with an interpreter or a read-eval-print-loop (REPL), which immediately executes partially written programs, exposing their semantics. The REPL addresses a basic challenge of program synthesis: tiny changes in syntax can lead to huge changes in semantics. We train a pair of models, a policy that proposes the new piece of code to write, and a value function that assesses the prospects of the code written so-far. At test time we can combine these models with a Sequential Monte Carlo algorithm. We apply our approach to two domains: synthesizing text editing programs and inferring 2D and 3D graphics programs.

* The first four authors contributed equally to this work

Via

Access Paper or Ask Questions

Selecting Representative Examples for Program Synthesis

Jun 07, 2018
Yewen Pu, Zachery Miranda, Armando Solar-Lezama, Leslie Pack Kaelbling

Figure 1 for Selecting Representative Examples for Program Synthesis

Figure 2 for Selecting Representative Examples for Program Synthesis

Figure 3 for Selecting Representative Examples for Program Synthesis

Figure 4 for Selecting Representative Examples for Program Synthesis

Program synthesis is a class of regression problems where one seeks a solution, in the form of a source-code program, mapping the inputs to their corresponding outputs exactly. Due to its precise and combinatorial nature, program synthesis is commonly formulated as a constraint satisfaction problem, where input-output examples are encoded as constraints and solved with a constraint solver. A key challenge of this formulation is scalability: while constraint solvers work well with a few well-chosen examples, a large set of examples can incur significant overhead in both time and memory. We describe a method to discover a subset of examples that is both small and representative: the subset is constructed iteratively, using a neural network to predict the probability of unchosen examples conditioned on the chosen examples in the subset, and greedily adding the least probable example. We empirically evaluate the representativeness of the subsets constructed by our method, and demonstrate such subsets can significantly improve synthesis time and stability.

Via

Access Paper or Ask Questions

Verifiable Reinforcement Learning via Policy Extraction

May 22, 2018
Osbert Bastani, Yewen Pu, Armando Solar-Lezama

Figure 1 for Verifiable Reinforcement Learning via Policy Extraction

Figure 2 for Verifiable Reinforcement Learning via Policy Extraction

Figure 3 for Verifiable Reinforcement Learning via Policy Extraction

Figure 4 for Verifiable Reinforcement Learning via Policy Extraction

While deep reinforcement learning has successfully solved many challenging control tasks, its real-world applicability has been limited by the inability to ensure the safety of learned policies. We propose an approach to verifiable reinforcement learning by training decision tree policies, which can represent complex policies (since they are nonparametric), yet can be efficiently verified using existing techniques (since they are highly structured). The challenge is that decision tree policies are difficult to train. We propose VIPER, an algorithm that combines ideas from model compression and imitation learning to learn decision tree policies guided by a DNN policy (called the oracle) and its Q-function, and show that it substantially outperforms two baselines. We use VIPER to (i) learn a provably robust decision tree policy for a variant of Atari Pong with a symbolic state space, (ii) learn a decision tree policy for a toy game based on Pong that provably never loses, and (iii) learn a provably stable decision tree policy for cart-pole. In each case, the decision tree policy achieves performance equal to that of the original DNN policy.

Via

Access Paper or Ask Questions

Learning to Acquire Information

Jul 11, 2017
Yewen Pu, Leslie P Kaelbling, Armando Solar-Lezama

Figure 1 for Learning to Acquire Information

Figure 2 for Learning to Acquire Information

Figure 3 for Learning to Acquire Information

Figure 4 for Learning to Acquire Information

We consider the problem of diagnosis where a set of simple observations are used to infer a potentially complex hidden hypothesis. Finding the optimal subset of observations is intractable in general, thus we focus on the problem of active diagnosis, where the agent selects the next most-informative observation based on the results of previous observations. We show that under the assumption of uniform observation entropy, one can build an implication model which directly predicts the outcome of the potential next observation conditioned on the results of past observations, and selects the observation with the maximum entropy. This approach enjoys reduced computation complexity by bypassing the complicated hypothesis space, and can be trained on observation data alone, learning how to query without knowledge of the hidden hypothesis.

Via

Access Paper or Ask Questions

sk_p: a neural program corrector for MOOCs

Jul 11, 2016
Yewen Pu, Karthik Narasimhan, Armando Solar-Lezama, Regina Barzilay

Figure 1 for sk_p: a neural program corrector for MOOCs

Figure 2 for sk_p: a neural program corrector for MOOCs

Figure 3 for sk_p: a neural program corrector for MOOCs

Figure 4 for sk_p: a neural program corrector for MOOCs

We present a novel technique for automatic program correction in MOOCs, capable of fixing both syntactic and semantic errors without manual, problem specific correction strategies. Given an incorrect student program, it generates candidate programs from a distribution of likely corrections, and checks each candidate for correctness against a test suite. The key observation is that in MOOCs many programs share similar code fragments, and the seq2seq neural network model, used in the natural-language processing task of machine translation, can be modified and trained to recover these fragments. Experiment shows our scheme can correct 29% of all incorrect submissions and out-performs state of the art approach which requires manual, problem specific correction strategies.

Via

Access Paper or Ask Questions