Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Binghong Chen

How to Design Sample and Computationally Efficient VQA Models

Mar 22, 2021
Karan Samel, Zelin Zhao, Binghong Chen, Kuan Wang, Robin Luo, Le Song

Figure 1 for How to Design Sample and Computationally Efficient VQA Models

Figure 2 for How to Design Sample and Computationally Efficient VQA Models

Figure 3 for How to Design Sample and Computationally Efficient VQA Models

Figure 4 for How to Design Sample and Computationally Efficient VQA Models

In multi-modal reasoning tasks, such as visual question answering (VQA), there have been many modeling and training paradigms tested. Previous models propose different methods for the vision and language tasks, but which ones perform the best while being sample and computationally efficient? Based on our experiments, we find that representing the text as probabilistic programs and images as object-level scene graphs best satisfy these desiderata. We extend existing models to leverage these soft programs and scene graphs to train on question answer pairs in an end-to-end manner. Empirical results demonstrate that this differentiable end-to-end program executor is able to maintain state-of-the-art accuracy while being sample and computationally efficient.

* 20 pages, 5 figures

Via

Access Paper or Ask Questions

**Retro: Learning Retrosynthetic Planning with Neural Guided A Search**

Jun 29, 2020
Binghong Chen, Chengtao Li, Hanjun Dai, Le Song

Figure 1 for Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

Figure 2 for Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

Figure 3 for Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

Figure 4 for Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

Retrosynthetic planning is a critical task in organic chemistry which identifies a series of reactions that can lead to the synthesis of a target product. The vast number of possible chemical transformations makes the size of the search space very big, and retrosynthetic planning is challenging even for experienced chemists. However, existing methods either require expensive return estimation by rollout with high variance, or optimize for search speed rather than the quality. In this paper, we propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently. It maintains the search as an AND-OR tree, and learns a neural search bias with off-policy data. Then guided by this neural network, it performs best-first search efficiently during new planning episodes. Experiments on benchmark USPTO datasets show that, our proposed method outperforms existing state-of-the-art with respect to both the success rate and solution quality, while being more efficient at the same time.

* Presented at ICML 2020

Via

Access Paper or Ask Questions

GLAD: Learning Sparse Graph Recovery

Jun 01, 2019
Harsh Shrivastava, Xinshi Chen, Binghong Chen, Guanghui Lan, Srinvas Aluru, Le Song

Figure 1 for GLAD: Learning Sparse Graph Recovery

Figure 2 for GLAD: Learning Sparse Graph Recovery

Figure 3 for GLAD: Learning Sparse Graph Recovery

Figure 4 for GLAD: Learning Sparse Graph Recovery

Recovering sparse conditional independence graphs from data is a fundamental problem in machine learning with wide applications. A popular formulation of the problem is an $\ell_1$ regularized maximum likelihood estimation. Many convex optimization algorithms have been designed to solve this formulation to recover the graph structure. Recently, there is a surge of interest to learn algorithms directly based on data, and in this case, learn to map empirical covariance to the sparse precision matrix. However, it is a challenging task in this case, since the symmetric positive definiteness (SPD) and sparsity of the matrix are not easy to enforce in learned algorithms, and a direct mapping from data to precision matrix may contain many parameters. We propose a deep learning architecture, GLAD, which uses an Alternating Minimization (AM) algorithm as our model inductive bias, and learns the model parameters via supervised learning. We show that GLAD learns a very compact and effective model for recovering sparse graph from data.

Via

Access Paper or Ask Questions

Learning to Plan via Neural Exploration-Exploitation Trees

Mar 26, 2019
Binghong Chen, Bo Dai, Le Song

Figure 1 for Learning to Plan via Neural Exploration-Exploitation Trees

Figure 2 for Learning to Plan via Neural Exploration-Exploitation Trees

Figure 3 for Learning to Plan via Neural Exploration-Exploitation Trees

Figure 4 for Learning to Plan via Neural Exploration-Exploitation Trees

Sampling-based planning algorithms such as RRT and its variants are powerful tools for path planning problems in high-dimensional continuous state and action spaces. While these algorithms perform systematic exploration of the state space, they do not fully exploit past planning experiences from similar environments. In this paper, we design a meta path planning algorithm, called Neural Exploration-Exploitation Trees (NEXT), which can utilize prior experience to drastically reduce the sample requirement for solving new path planning problems. More specifically, NEXT contains a novel neural architecture which can learn from experiences the dependency between task structures and promising path search directions. Then this learned prior is integrated with a UCB-type algorithm to achieve an online balance between exploration and exploitation when solving a new problem. Empirically, we show that NEXT can complete the planning tasks with very small search trees and significantly outperforms previous state-of-the-arts on several benchmark problems.

* 25 pages, 60 figures

Via

Access Paper or Ask Questions

A Communication-Efficient Parallel Method for Group-Lasso

Dec 07, 2016
Binghong Chen, Jun Zhu

Figure 1 for A Communication-Efficient Parallel Method for Group-Lasso

Figure 2 for A Communication-Efficient Parallel Method for Group-Lasso

Figure 3 for A Communication-Efficient Parallel Method for Group-Lasso

Figure 4 for A Communication-Efficient Parallel Method for Group-Lasso

Group-Lasso (gLasso) identifies important explanatory factors in predicting the response variable by considering the grouping structure over input variables. However, most existing algorithms for gLasso are not scalable to deal with large-scale datasets, which are becoming a norm in many applications. In this paper, we present a divide-and-conquer based parallel algorithm (DC-gLasso) to scale up gLasso in the tasks of regression with grouping structures. DC-gLasso only needs two iterations to collect and aggregate the local estimates on subsets of the data, and is provably correct to recover the true model under certain conditions. We further extend it to deal with overlappings between groups. Empirical results on a wide range of synthetic and real-world datasets show that DC-gLasso can significantly improve the time efficiency without sacrificing regression accuracy.

* 7 pages

Via

Access Paper or Ask Questions

Binghong Chen

How to Design Sample and Computationally Efficient VQA Models

Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search

GLAD: Learning Sparse Graph Recovery

Learning to Plan via Neural Exploration-Exploitation Trees

A Communication-Efficient Parallel Method for Group-Lasso

**Retro: Learning Retrosynthetic Planning with Neural Guided A Search**