Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kavitha Srinivas

Generalized Planning in PDDL Domains with Pretrained Large Language Models

May 18, 2023

Tom Silver, Soham Dan, Kavitha Srinivas, Joshua B. Tenenbaum, Leslie Pack Kaelbling, Michael Katz

Figure 1 for Generalized Planning in PDDL Domains with Pretrained Large Language Models

Figure 2 for Generalized Planning in PDDL Domains with Pretrained Large Language Models

Figure 3 for Generalized Planning in PDDL Domains with Pretrained Large Language Models

Figure 4 for Generalized Planning in PDDL Domains with Pretrained Large Language Models

Abstract:Recent work has considered whether large language models (LLMs) can function as planners: given a task, generate a plan. We investigate whether LLMs can serve as generalized planners: given a domain and training tasks, generate a program that efficiently produces plans for other tasks in the domain. In particular, we consider PDDL domains and use GPT-4 to synthesize Python programs. We also consider (1) Chain-of-Thought (CoT) summarization, where the LLM is prompted to summarize the domain and propose a strategy in words before synthesizing the program; and (2) automated debugging, where the program is validated with respect to the training tasks, and in case of errors, the LLM is re-prompted with four types of feedback. We evaluate this approach in seven PDDL domains and compare it to four ablations and four baselines. Overall, we find that GPT-4 is a surprisingly powerful generalized planner. We also conclude that automated debugging is very important, that CoT summarization has non-uniform impact, that GPT-4 is far superior to GPT-3.5, and that just two training tasks are often sufficient for strong generalization.

Via

Access Paper or Ask Questions

A Vision for Semantically Enriched Data Science

Mar 02, 2023

Udayan Khurana, Kavitha Srinivas, Sainyam Galhotra, Horst Samulowitz

Abstract:The recent efforts in automation of machine learning or data science has achieved success in various tasks such as hyper-parameter optimization or model selection. However, key areas such as utilizing domain knowledge and data semantics are areas where we have seen little automation. Data Scientists have long leveraged common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In this paper we discuss important shortcomings of current data science and machine learning solutions. We then envision how leveraging "semantic" understanding and reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation. Additionally, we discuss how semantics can assist data scientists in a new manner by helping with challenges related to trust, bias, and explainability in machine learning. Semantic annotation can also help better explore and organize large data sources.

* arXiv admin note: substantial text overlap with arXiv:2205.08018

Via

Access Paper or Ask Questions

Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Jan 05, 2023

Wenting Zhao, Ibrahim Abdelaziz, Julian Dolby, Kavitha Srinivas, Mossad Helali, Essam Mansour

Figure 1 for Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Figure 2 for Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Figure 3 for Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Figure 4 for Serenity: Library Based Python Code Analysis for Code Completion and Automated Machine Learning

Abstract:Dynamically typed languages such as Python have become very popular. Among other strengths, Python's dynamic nature and its straightforward linking to native code have made it the de-facto language for many research areas such as Artificial Intelligence. This flexibility, however, makes static analysis very hard. While creating a sound, or a soundy, analysis for Python remains an open problem, we present in this work Serenity, a framework for static analysis of Python that turns out to be sufficient for some tasks. The Serenity framework exploits two basic mechanisms: (a) reliance on dynamic dispatch at the core of language translation, and (b) extreme abstraction of libraries, to generate an abstraction of the code. We demonstrate the efficiency and usefulness of Serenity's analysis in two applications: code completion and automated machine learning. In these two applications, we demonstrate that such analysis has a strong signal, and can be leveraged to establish state-of-the-art performance, comparable to neural models and dynamic analysis respectively.

Via

Access Paper or Ask Questions

Exploring Code Style Transfer with Neural Networks

Sep 13, 2022

Karl Munson, Anish Savla, Chih-Kai Ting, Serenity Wade, Kiran Kate, Kavitha Srinivas

Figure 1 for Exploring Code Style Transfer with Neural Networks

Figure 2 for Exploring Code Style Transfer with Neural Networks

Figure 3 for Exploring Code Style Transfer with Neural Networks

Figure 4 for Exploring Code Style Transfer with Neural Networks

Abstract:Style is a significant component of natural language text, reflecting a change in the tone of text while keeping the underlying information the same. Even though programming languages have strict syntax rules, they also have style. Code can be written with the same functionality but using different language features. However, programming style is difficult to quantify, and thus as part of this work, we define style attributes, specifically for Python. To build a definition of style, we utilized hierarchical clustering to capture a style definition without needing to specify transformations. In addition to defining style, we explore the capability of a pre-trained code language model to capture information about code style. To do this, we fine-tuned pre-trained code-language models and evaluated their performance in code style transfer tasks.

Via

Access Paper or Ask Questions

A Survey on Semantics in Automated Data Science

May 16, 2022

Udayan Khurana, Kavitha Srinivas, Horst Samulowitz

Figure 1 for A Survey on Semantics in Automated Data Science

Abstract:Data Scientists leverage common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In recent years, we have witnessed a surge in tools and techniques for {\em automated machine learning}. While data scientists can employ various such tools to help with model building, many other aspects such as {\em feature engineering} that require semantic understanding of concepts, remain manual to a large extent. In this paper we discuss important shortcomings of current automated data science solutions and machine learning. We discuss how leveraging basic semantic reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation. Moreover, semantics can assist data scientists in a new manner by helping with challenges related to {\em trust}, {\em bias}, and {\em explainability}.

Via

Access Paper or Ask Questions

Federated Data Science to Break Down Silos

Nov 25, 2021

Essam Mansour, Kavitha Srinivas, Katja Hose

Figure 1 for Federated Data Science to Break Down Silos

Figure 2 for Federated Data Science to Break Down Silos

Figure 3 for Federated Data Science to Break Down Silos

Abstract:Similar to Open Data initiatives, data science as a community has launched initiatives for sharing not only data but entire pipelines, derivatives, artifacts, etc. (Open Data Science). However, the few efforts that exist focus on the technical part on how to facilitate sharing, conversion, etc. This vision paper goes a step further and proposes KEK, an open federated data science platform that does not only allow for sharing data science pipelines and their (meta)data but also provides methods for efficient search and, in the ideal case, even allows for combining and defining pipelines across platforms in a federated manner. In doing so, KEK addresses the so far neglected challenge of actually finding artifacts that are semantically related and that can be combined to achieve a certain goal.

* Accepted at SIGMOD Record

Via

Access Paper or Ask Questions

A Scalable AutoML Approach Based on Graph Neural Networks

Oct 29, 2021

Mossad Helali, Essam Mansour, Ibrahim Abdelaziz, Julian Dolby, Kavitha Srinivas

Figure 1 for A Scalable AutoML Approach Based on Graph Neural Networks

Figure 2 for A Scalable AutoML Approach Based on Graph Neural Networks

Figure 3 for A Scalable AutoML Approach Based on Graph Neural Networks

Figure 4 for A Scalable AutoML Approach Based on Graph Neural Networks

Abstract:AutoML systems build machine learning models automatically by performing a search over valid data transformations and learners, along with hyper-parameter optimization for each learner. We present a system called KGpip for the selection of transformations and learners, which (1) builds a database of datasets and corresponding historically used pipelines using effective static analysis instead of the typical use of actual runtime information, (2) uses dataset embeddings to find similar datasets in the database based on its content instead of metadata-based features, (3) models AutoML pipeline creation as a graph generation problem, to succinctly characterize the diverse pipelines seen for a single dataset. KGpip is designed as a sub-component for AutoML systems. We demonstrate this ability via integrating KGpip with two AutoML systems and show that it does significantly enhance the performance of existing state-of-the-art systems.

* 15 pages, 10 figures

Via

Access Paper or Ask Questions

Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Sep 15, 2021

Ibrahim Abdelaziz, Julian Dolby, Jamie McCusker, Kavitha Srinivas

Figure 1 for Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Figure 2 for Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Figure 3 for Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Figure 4 for Can Machines Read Coding Manuals Yet? -- A Benchmark for Building Better Language Models for Code Understanding

Abstract:Code understanding is an increasingly important application of Artificial Intelligence. A fundamental aspect of understanding code is understanding text about code, e.g., documentation and forum discussions. Pre-trained language models (e.g., BERT) are a popular approach for various NLP tasks, and there are now a variety of benchmarks, such as GLUE, to help improve the development of such models for natural language understanding. However, little is known about how well such models work on textual artifacts about code, and we are unaware of any systematic set of downstream tasks for such an evaluation. In this paper, we derive a set of benchmarks (BLANCA - Benchmarks for LANguage models on Coding Artifacts) that assess code understanding based on tasks such as predicting the best answer to a question in a forum post, finding related forum posts, or predicting classes related in a hierarchy from class documentation. We evaluate the performance of current state-of-the-art language models on these tasks and show that there is a significant improvement on each task from fine tuning. We also show that multi-task training over BLANCA tasks helps build better language models for code understanding.

Via

Access Paper or Ask Questions

Learning to Guide a Saturation-Based Theorem Prover

Jun 07, 2021

Ibrahim Abdelaziz, Maxwell Crouse, Bassem Makni, Vernon Austil, Cristina Cornelio, Shajith Ikbal, Pavan Kapanipathi, Ndivhuwo Makondo, Kavitha Srinivas, Michael Witbrock(+1 more)

Figure 1 for Learning to Guide a Saturation-Based Theorem Prover

Figure 2 for Learning to Guide a Saturation-Based Theorem Prover

Figure 3 for Learning to Guide a Saturation-Based Theorem Prover

Figure 4 for Learning to Guide a Saturation-Based Theorem Prover

Abstract:Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes core elements of saturation-based theorem proving within a neural framework. TRAIL leverages (a) an effective graph neural network for representing logical formulas, (b) a novel neural representation of the state of a saturation-based theorem prover in terms of processed clauses and available actions, and (c) a novel representation of the inference selection process as an attention-based action policy. We show through a systematic analysis that these components allow TRAIL to significantly outperform previous reinforcement learning-based theorem provers on two standard benchmark datasets (up to 36% more theorems proved). In addition, to the best of our knowledge, TRAIL is the first reinforcement learning-based approach to exceed the performance of a state-of-the-art traditional theorem prover on a standard theorem proving benchmark (solving up to 17% more problems).

Via

Access Paper or Ask Questions

Graph4Code: A Machine Interpretable Knowledge Graph for Code

Feb 21, 2020

Kavitha Srinivas, Ibrahim Abdelaziz, Julian Dolby, James P. McCusker

Figure 1 for Graph4Code: A Machine Interpretable Knowledge Graph for Code

Figure 2 for Graph4Code: A Machine Interpretable Knowledge Graph for Code

Figure 3 for Graph4Code: A Machine Interpretable Knowledge Graph for Code

Figure 4 for Graph4Code: A Machine Interpretable Knowledge Graph for Code

Abstract:Knowledge graphs have proven to be extremely useful in powering diverse applications in semantic search, natural language understanding, and even image classification. Graph4Code attempts to build well structured knowledge graphs about program code to similarly revolutionize diverse applications such as code search, code understanding, refactoring, bug detection, and code automation. We build such a graph by applying a set of generic code analysis techniques to Python code on the web. Since use of popular Python modules is ubiquitous in code, calls to functions in Python modules serve as key nodes of the knowledge graph. The edges in the graph are based on 1) function usage in the wild (e.g., which other function tends to call this one, or which function tends to precede this one, as gleaned from program analysis), 2) documentation about the function (e.g., code documentation, usage documentation, or forum discussions such as StackOverflow), and 3) program specific features such as class hierarchies. We use the Whyis knowledge graph management framework to make the graph easily extensible. We apply these techniques to 1.3M Python files drawn from GitHub, and associated documentation on the web for over 400 popular libraries, as well as StackOverflow posts about the same set of libraries. This knowledge graph will be made available soon to the larger community for use.

Via

Access Paper or Ask Questions