Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jianshu Chen

Incorporating Structured Commonsense Knowledge in Story Completion

Nov 01, 2018
Jiaao Chen, Jianshu Chen, Zhou Yu

Figure 1 for Incorporating Structured Commonsense Knowledge in Story Completion

Figure 2 for Incorporating Structured Commonsense Knowledge in Story Completion

Figure 3 for Incorporating Structured Commonsense Knowledge in Story Completion

Figure 4 for Incorporating Structured Commonsense Knowledge in Story Completion

The ability to select an appropriate story ending is the first step towards perfect narrative comprehension. Story ending prediction requires not only the explicit clues within the context, but also the implicit knowledge (such as commonsense) to construct a reasonable and consistent story. However, most previous approaches do not explicitly use background commonsense knowledge. We present a neural story ending selection model that integrates three types of information: narrative sequence, sentiment evolution and commonsense knowledge. Experiments show that our model outperforms state-of-the-art approaches on a public dataset, ROCStory Cloze Task , and the performance gain from adding the additional commonsense knowledge is significant.

Via

Access Paper or Ask Questions

M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search

Nov 01, 2018
Yelong Shen, Jianshu Chen, Po-Sen Huang, Yuqing Guo, Jianfeng Gao

Figure 1 for M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search

Figure 2 for M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search

Figure 3 for M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search

Figure 4 for M-Walk: Learning to Walk over Graphs using Monte Carlo Tree Search

Learning to walk over a graph towards a target node for a given query and a source node is an important problem in applications such as knowledge base completion (KBC). It can be formulated as a reinforcement learning (RL) problem with a known state transition model. To overcome the challenge of sparse rewards, we develop a graph-walking agent called M-Walk, which consists of a deep recurrent neural network (RNN) and Monte Carlo Tree Search (MCTS). The RNN encodes the state (i.e., history of the walked path) and maps it separately to a policy and Q-values. In order to effectively train the agent from sparse rewards, we combine MCTS with the neural policy to generate trajectories yielding more positive rewards. From these trajectories, the network is improved in an off-policy manner using Q-learning, which modifies the RNN policy via parameter sharing. Our proposed RL algorithm repeatedly applies this policy-improvement step to learn the model. At test time, MCTS is combined with the neural policy to predict the target node. Experimental results on several graph-walking benchmarks show that M-Walk is able to learn better policies than other RL-based methods, which are mainly based on policy gradients. M-Walk also outperforms traditional KBC baselines.

* Yelong Shen, Jianshu Chen and Po-Sen Huang contributed equally to the paper. Published at NIPS 2018

Via

Access Paper or Ask Questions

P-MCGS: Parallel Monte Carlo Acyclic Graph Search

Oct 28, 2018
Chen Yu, Jianshu Chen, Jie Zhong, Ji Liu

Figure 1 for P-MCGS: Parallel Monte Carlo Acyclic Graph Search

Figure 2 for P-MCGS: Parallel Monte Carlo Acyclic Graph Search

Figure 3 for P-MCGS: Parallel Monte Carlo Acyclic Graph Search

Figure 4 for P-MCGS: Parallel Monte Carlo Acyclic Graph Search

Recently, there have been great interests in Monte Carlo Tree Search (MCTS) in AI research. Although the sequential version of MCTS has been studied widely, its parallel counterpart still lacks systematic study. This leads us to the following questions: \emph{how to design efficient parallel MCTS (or more general cases) algorithms with rigorous theoretical guarantee? Is it possible to achieve linear speedup?} In this paper, we consider the search problem on a more general acyclic one-root graph (namely, Monte Carlo Graph Search (MCGS)), which generalizes MCTS. We develop a parallel algorithm (P-MCGS) to assign multiple workers to investigate appropriate leaf nodes simultaneously. Our analysis shows that P-MCGS algorithm achieves linear speedup and that the sample complexity is comparable to its sequential counterpart.

Via

Access Paper or Ask Questions

XL-NBT: A Cross-lingual Neural Belief Tracking Framework

Aug 25, 2018
Wenhu Chen, Jianshu Chen, Yu Su, Xin Wang, Dong Yu, Xifeng Yan, William Yang Wang

Figure 1 for XL-NBT: A Cross-lingual Neural Belief Tracking Framework

Figure 2 for XL-NBT: A Cross-lingual Neural Belief Tracking Framework

Figure 3 for XL-NBT: A Cross-lingual Neural Belief Tracking Framework

Figure 4 for XL-NBT: A Cross-lingual Neural Belief Tracking Framework

Task-oriented dialog systems are becoming pervasive, and many companies heavily rely on them to complement human agents for customer service in call centers. With globalization, the need for providing cross-lingual customer support becomes more urgent than ever. However, cross-lingual support poses great challenges---it requires a large amount of additional annotated data from native speakers. In order to bypass the expensive human annotation and achieve the first step towards the ultimate goal of building a universal dialog system, we set out to build a cross-lingual state tracking framework. Specifically, we assume that there exists a source language with dialog belief tracking annotations while the target languages have no annotated dialog data of any form. Then, we pre-train a state tracker for the source language as a teacher, which is able to exploit easy-to-access parallel data. We then distill and transfer its own knowledge to the student state tracker in target languages. We specifically discuss two types of common parallel resources: bilingual corpus and bilingual dictionary, and design different transfer learning strategies accordingly. Experimentally, we successfully use English state tracker as the teacher to transfer its knowledge to both Italian and German trackers and achieve promising results.

* 13 pages, 5 figures, 3 tables, accepted to EMNLP 2018 conference

Via

Access Paper or Ask Questions

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

Jun 05, 2018
Bo Dai, Albert Shaw, Lihong Li, Lin Xiao, Niao He, Zhen Liu, Jianshu Chen, Le Song

Figure 1 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

Figure 2 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

Figure 3 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

When function approximation is used, solving the Bellman optimality equation with stability guarantees has remained a major open problem in reinforcement learning for decades. The fundamental difficulty is that the Bellman operator may become an expansion in general, resulting in oscillating and even divergent behavior of popular algorithms like Q-learning. In this paper, we revisit the Bellman equation, and reformulate it into a novel primal-dual optimization problem using Nesterov's smoothing technique and the Legendre-Fenchel transformation. We then develop a new algorithm, called Smoothed Bellman Error Embedding, to solve this optimization problem where any differentiable function class may be used. We provide what we believe to be the first convergence guarantee for general nonlinear function approximation, and analyze the algorithm's sample complexity. Empirically, our algorithm compares favorably to state-of-the-art baselines in several benchmark control problems.

* 28 pages, 13 figures. To appear at the 35th International Conference on Machine Learning (ICML 2018)

Via

Access Paper or Ask Questions

A Learning-to-Infer Method for Real-Time Power Grid Topology Identification

Oct 21, 2017
Yue Zhao, Jianshu Chen, H. Vincent Poor

Figure 1 for A Learning-to-Infer Method for Real-Time Power Grid Topology Identification

Figure 2 for A Learning-to-Infer Method for Real-Time Power Grid Topology Identification

Figure 3 for A Learning-to-Infer Method for Real-Time Power Grid Topology Identification

Figure 4 for A Learning-to-Infer Method for Real-Time Power Grid Topology Identification

Identifying arbitrary topologies of power networks in real time is a computationally hard problem due to the number of hypotheses that grows exponentially with the network size. A new "Learning-to-Infer" variational inference method is developed for efficient inference of every line status in the network. Optimizing the variational model is transformed to and solved as a discriminative learning problem based on Monte Carlo samples generated with power flow simulations. A major advantage of the developed Learning-to-Infer method is that the labeled data used for training can be generated in an arbitrarily large amount fast and at very little cost. As a result, the power of offline training is fully exploited to learn very complex classifiers for effective real-time topology identification. The proposed methods are evaluated in the IEEE 30, 118 and 300 bus systems. Excellent performance in identifying arbitrary power network topologies in real time is achieved even with relatively simple variational models and a reasonably small amount of data.

Via

Access Paper or Ask Questions

Stochastic Variance Reduction Methods for Policy Evaluation

Jun 09, 2017
Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

Figure 1 for Stochastic Variance Reduction Methods for Policy Evaluation

Figure 2 for Stochastic Variance Reduction Methods for Policy Evaluation

Figure 3 for Stochastic Variance Reduction Methods for Policy Evaluation

Figure 4 for Stochastic Variance Reduction Methods for Policy Evaluation

Policy evaluation is a crucial step in many reinforcement-learning procedures, which estimates a value function that predicts states' long-term value under a given policy. In this paper, we focus on policy evaluation with linear function approximation over a fixed dataset. We first transform the empirical policy evaluation problem into a (quadratic) convex-concave saddle point problem, and then present a primal-dual batch gradient method, as well as two stochastic variance reduction methods for solving the problem. These algorithms scale linearly in both sample size and feature dimension. Moreover, they achieve linear convergence even when the saddle-point problem has only strong concavity in the dual variables but no strong convexity in the primal variables. Numerical experiments on benchmark problems demonstrate the effectiveness of our methods.

* Accepted by ICML 2017

Via

Access Paper or Ask Questions

Unsupervised Sequence Classification using Sequential Output Statistics

May 26, 2017
Yu Liu, Jianshu Chen, Li Deng

Figure 1 for Unsupervised Sequence Classification using Sequential Output Statistics

Figure 2 for Unsupervised Sequence Classification using Sequential Output Statistics

Figure 3 for Unsupervised Sequence Classification using Sequential Output Statistics

Figure 4 for Unsupervised Sequence Classification using Sequential Output Statistics

We consider learning a sequence classifier without labeled data by using sequential output statistics. The problem is highly valuable since obtaining labels in training data is often costly, while the sequential output statistics (e.g., language models) could be obtained independently of input data and thus with low or no cost. To address the problem, we propose an unsupervised learning cost function and study its properties. We show that, compared to earlier works, it is less inclined to be stuck in trivial solutions and avoids the need for a strong generative model. Although it is harder to optimize in its functional form, a stochastic primal-dual gradient method is developed to effectively solve the problem. Experiment results on real-world datasets demonstrate that the new unsupervised learning method gives drastically lower errors than other baseline methods. Specifically, it reaches test errors about twice of those obtained by fully supervised learning.

* All authors contributed equally to the paper. 17 pages, 7 figures and 2 tables

Via

Access Paper or Ask Questions

Character-level Deep Conflation for Business Data Analytics

Feb 08, 2017
Zhe Gan, P. D. Singh, Ameet Joshi, Xiaodong He, Jianshu Chen, Jianfeng Gao, Li Deng

Figure 1 for Character-level Deep Conflation for Business Data Analytics

Figure 2 for Character-level Deep Conflation for Business Data Analytics

Figure 3 for Character-level Deep Conflation for Business Data Analytics

Figure 4 for Character-level Deep Conflation for Business Data Analytics

Connecting different text attributes associated with the same entity (conflation) is important in business data analytics since it could help merge two different tables in a database to provide a more comprehensive profile of an entity. However, the conflation task is challenging because two text strings that describe the same entity could be quite different from each other for reasons such as misspelling. It is therefore critical to develop a conflation model that is able to truly understand the semantic meaning of the strings and match them at the semantic level. To this end, we develop a character-level deep conflation model that encodes the input text strings from character level into finite dimension feature vectors, which are then used to compute the cosine similarity between the text strings. The model is trained in an end-to-end manner using back propagation and stochastic gradient descent to maximize the likelihood of the correct association. Specifically, we propose two variants of the deep conflation model, based on long-short-term memory (LSTM) recurrent neural network (RNN) and convolutional neural network (CNN), respectively. Both models perform well on a real-world business analytics dataset and significantly outperform the baseline bag-of-character (BoC) model.

* Accepted for publication, at ICASSP 2017

Via

Access Paper or Ask Questions

Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

Sep 17, 2016
Ji He, Mari Ostendorf, Xiaodong He, Jianshu Chen, Jianfeng Gao, Lihong Li, Li Deng

Figure 1 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

Figure 2 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

Figure 3 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

Figure 4 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

We introduce an online popularity prediction and tracking task as a benchmark task for reinforcement learning with a combinatorial, natural language action space. A specified number of discussion threads predicted to be popular are recommended, chosen from a fixed window of recent comments to track. Novel deep reinforcement learning architectures are studied for effective modeling of the value function associated with actions comprised of interdependent sub-actions. The proposed model, which represents dependence between sub-actions through a bi-directional LSTM, gives the best performance across different experimental configurations and domains, and it also generalizes well with varying numbers of recommendation requests.

* To be published in EMNLP 2016, 11 pages

Via

Access Paper or Ask Questions