Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiahai Wang

Neural Relational Inference with Efficient Message Passing Mechanisms

Jan 23, 2021

Siyuan Chen, Jiahai Wang, Guoqing Li

Figure 1 for Neural Relational Inference with Efficient Message Passing Mechanisms

Figure 2 for Neural Relational Inference with Efficient Message Passing Mechanisms

Figure 3 for Neural Relational Inference with Efficient Message Passing Mechanisms

Figure 4 for Neural Relational Inference with Efficient Message Passing Mechanisms

Abstract:Many complex processes can be viewed as dynamical systems of interacting agents. In many cases, only the state sequences of individual agents are observed, while the interacting relations and the dynamical rules are unknown. The neural relational inference (NRI) model adopts graph neural networks that pass messages over a latent graph to jointly learn the relations and the dynamics based on the observed data. However, NRI infers the relations independently and suffers from error accumulation in multi-step prediction at dynamics learning procedure. Besides, relation reconstruction without prior knowledge becomes more difficult in more complex systems. This paper introduces efficient message passing mechanisms to the graph neural networks with structural prior knowledge to address these problems. A relation interaction mechanism is proposed to capture the coexistence of all relations, and a spatio-temporal message passing mechanism is proposed to use historical information to alleviate error accumulation. Additionally, the structural prior knowledge, symmetry as a special case, is introduced for better relation prediction in more complex systems. The experimental results on simulated physics systems show that the proposed method outperforms existing state-of-the-art methods.

* Accepted by AAAI 2021, 13 pages, 9 figures, 4 tables

Via

Access Paper or Ask Questions

Neural Deepfake Detection with Factual Structure of Text

Oct 15, 2020

Wanjun Zhong, Duyu Tang, Zenan Xu, Ruize Wang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

Figure 1 for Neural Deepfake Detection with Factual Structure of Text

Figure 2 for Neural Deepfake Detection with Factual Structure of Text

Figure 3 for Neural Deepfake Detection with Factual Structure of Text

Figure 4 for Neural Deepfake Detection with Factual Structure of Text

Abstract:Deepfake detection, the task of automatically discriminating machine-generated text, is increasingly critical with recent advances in natural language generative models. Existing approaches to deepfake detection typically represent documents with coarse-grained representations. However, they struggle to capture factual structures of documents, which is a discriminative factor between machine-generated and human-written text according to our statistical analysis. To address this, we propose a graph-based model that utilizes the factual structure of a document for deepfake detection of text. Our approach represents the factual structure of a given document as an entity graph, which is further utilized to learn sentence representations with a graph neural network. Sentence representations are then composed to a document representation for making predictions, where consistent relations between neighboring sentences are sequentially modeled. Results of experiments on two public deepfake datasets show that our approach significantly improves strong base models built with RoBERTa. Model analysis further indicates that our model can distinguish the difference in the factual structure between machine-generated text and human-written text.

* EMNLP2020;10 pages

Via

Access Paper or Ask Questions

LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

Apr 28, 2020

Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Ming Gong, Linjun Shou, Daxin Jiang, Jiahai Wang, Jian Yin

Figure 1 for LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

Figure 2 for LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

Figure 3 for LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

Figure 4 for LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network

Abstract:Verifying the correctness of a textual statement requires not only semantic reasoning about the meaning of words, but also symbolic reasoning about logical operations like count, superlative, aggregation, etc. In this work, we propose LogicalFactChecker, a neural network approach capable of leveraging logical operations for fact checking. It achieves the state-of-the-art performance on TABFACT, a large-scale, benchmark dataset built for verifying a textual statement with semi-structured tables. This is achieved by a graph module network built upon the Transformer-based architecture. With a textual statement and a table as the input, LogicalFactChecker automatically derives a program (a.k.a. logical form) of the statement in a semantic parsing manner. A heterogeneous graph is then constructed to capture not only the structures of the table and the program, but also the connections between inputs with different modalities. Such a graph reveals the related contexts of each word in the statement, the table and the program. The graph is used to obtain graph-enhanced contextual representations of words in Transformer-based architecture. After that, a program-driven module network is further introduced to exploit the hierarchical structure of the program, where semantic compositionality is dynamically modeled along the program structure with a set of function-specific modules. Ablation experiments suggest that both the heterogeneous graph and the module network are important to obtain strong results.

* 13 pages; 7 figures; Accepted by ACL2020 as a long paper

Via

Access Paper or Ask Questions

A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

Apr 25, 2020

Wanjun Zhong, Duyu Tang, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

Figure 1 for A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

Figure 2 for A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

Figure 3 for A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

Figure 4 for A Heterogeneous Graph with Factual, Temporal and Logical Knowledge for Question Answering Over Dynamic Contexts

Abstract:We study question answering over a dynamic textual environment. Although neural network models achieve impressive accuracy via learning from input-output examples, they rarely leverage various types of knowledge and are generally not interpretable. In this work, we propose a graph-based approach, where a heterogeneous graph is automatically built with factual knowledge of the context, temporal knowledge of the past states, and logical knowledge that combines human-curated knowledge bases and rule bases. We develop a graph neural network over the constructed graph, and train the model in an end-to-end manner. Experimental results on a benchmark dataset show that the injection of various types of knowledge improves a strong neural network baseline. An additional benefit of our approach is that the graph itself naturally serves as a rational behind the decision making.

* 9 pages

Via

Access Paper or Ask Questions

A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage

Mar 16, 2020

Siyuan Chen, Jiahai Wang, Xin Du, Yanqing Hu

Figure 1 for A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage

Figure 2 for A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage

Figure 3 for A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage

Figure 4 for A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage

Abstract:User identity linkage across social networks is an essential problem for cross-network data mining. Since network structure, profile and content information describe different aspects of users, it is critical to learn effective user representations that integrate heterogeneous information. This paper proposes a novel framework with INformation FUsion and Neighborhood Enhancement (INFUNE) for user identity linkage. The information fusion component adopts a group of encoders and decoders to fuse heterogeneous information and generate discriminative node embeddings for preliminary matching. Then, these embeddings are fed to the neighborhood enhancement component, a novel graph neural network, to produce adaptive neighborhood embeddings that reflect the overlapping degree of neighborhoods of varying candidate user pairs. The importance of node embeddings and neighborhood embeddings are weighted for final prediction. The proposed method is evaluated on real-world social network data. The experimental results show that INFUNE significantly outperforms existing state-of-the-art methods.

* 8 pages, 7 figures, accepted by ECAI 2020

Via

Access Paper or Ask Questions

MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization

Feb 13, 2020

Hong Wu, Jiahai Wang, Zizhen Zhang

Figure 1 for MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization

Figure 2 for MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization

Figure 3 for MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization

Figure 4 for MODRL/D-AM: Multiobjective Deep Reinforcement Learning Algorithm Using Decomposition and Attention Model for Multiobjective Optimization

Abstract:Recently, a deep reinforcement learning method is proposed to solve multiobjective optimization problem. In this method, the multiobjective optimization problem is decomposed to a number of single-objective optimization subproblems and all the subproblems are optimized in a collaborative manner. Each subproblem is modeled with a pointer network and the model is trained with reinforcement learning. However, when pointer network extracts the features of an instance, it ignores the underlying structure information of the input nodes. Thus, this paper proposes a multiobjective deep reinforcement learning method using decomposition and attention model to solve multiobjective optimization problem. In our method, each subproblem is solved by an attention model, which can exploit the structure features as well as node features of input nodes. The experiment results on multiobjective travelling salesman problem show the proposed algorithm achieves better performance compared with the previous method.

Via

Access Paper or Ask Questions

Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Feb 12, 2020

Youwei Song, Jiahai Wang, Zhiwei Liang, Zhiyue Liu, Tao Jiang

Figure 1 for Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Figure 2 for Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Figure 3 for Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Figure 4 for Utilizing BERT Intermediate Layers for Aspect Based Sentiment Analysis and Natural Language Inference

Abstract:Aspect based sentiment analysis aims to identify the sentimental tendency towards a given aspect in text. Fine-tuning of pretrained BERT performs excellent on this task and achieves state-of-the-art performances. Existing BERT-based works only utilize the last output layer of BERT and ignore the semantic knowledge in the intermediate layers. This paper explores the potential of utilizing BERT intermediate layers to enhance the performance of fine-tuning of BERT. To the best of our knowledge, no existing work has been done on this research. To show the generality, we also apply this approach to a natural language inference task. Experimental results demonstrate the effectiveness and generality of the proposed approach.

* 5 pages, 2 figures

Via

Access Paper or Ask Questions

A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Feb 09, 2020

Bo Peng, Jiahai Wang, Zizhen Zhang

Figure 1 for A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Figure 2 for A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Figure 3 for A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Figure 4 for A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Abstract:Recent researches show that machine learning has the potential to learn better heuristics than the one designed by human for solving combinatorial optimization problems. The deep neural network is used to characterize the input instance for constructing a feasible solution incrementally. Recently, an attention model is proposed to solve routing problems. In this model, the state of an instance is represented by node features that are fixed over time. However, the fact is, the state of an instance is changed according to the decision that the model made at different construction steps, and the node features should be updated correspondingly. Therefore, this paper presents a dynamic attention model with dynamic encoder-decoder architecture, which enables the model to explore node features dynamically and exploit hidden structure information effectively at different construction steps. This paper focuses on a challenging NP-hard problem, vehicle routing problem. The experiments indicate that our model outperforms the previous methods and also shows a good generalization performance.

* 15 pages, 8 figures

Via

Access Paper or Ask Questions

CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation

Nov 20, 2019

Zhiyue Liu, Jiahai Wang, Zhiwei Liang

Figure 1 for CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation

Figure 2 for CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation

Figure 3 for CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation

Figure 4 for CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text Generation

Abstract:Generating multiple categories of texts is a challenging task and draws more and more attention. Since generative adversarial nets (GANs) have shown competitive results on general text generation, they are extended for category text generation in some previous works. However, the complicated model structures and learning strategies limit their performance and exacerbate the training instability. This paper proposes a category-aware GAN (CatGAN) which consists of an efficient category-aware model for category text generation and a hierarchical evolutionary learning algorithm for training our model. The category-aware model directly measures the gap between real samples and generated samples on each category, then reducing this gap will guide the model to generate high-quality category samples. The Gumbel-Softmax relaxation further frees our model from complicated learning strategies for updating CatGAN on discrete data. Moreover, only focusing on the sample quality normally leads the mode collapse problem, thus a hierarchical evolutionary learning algorithm is introduced to stabilize the training procedure and obtain the trade-off between quality and diversity while training CatGAN. Experimental results demonstrate that CatGAN outperforms most of the existing state-of-the-art methods.

* 15 pages, 4 figures. Accepted by AAAI 2020

Via

Access Paper or Ask Questions

Reasoning Over Semantic-Level Graph for Fact Checking

Sep 13, 2019

Wanjun Zhong, Jingjing Xu, Duyu Tang, Zenan Xu, Nan Duan, Ming Zhou, Jiahai Wang, Jian Yin

Figure 1 for Reasoning Over Semantic-Level Graph for Fact Checking

Figure 2 for Reasoning Over Semantic-Level Graph for Fact Checking

Figure 3 for Reasoning Over Semantic-Level Graph for Fact Checking

Figure 4 for Reasoning Over Semantic-Level Graph for Fact Checking

Abstract:We study fact-checking in this paper, which aims to verify a textual claim given textual evidence (e.g., retrieved sentences from Wikipedia). Existing studies typically either concatenate retrieved sentences as a single string or use feature fusion on the top of features of sentences, while ignoring semantic-level information including participants, location, and temporality of an event occurred in a sentence and relationships among multiple events. Such semantic-level information is crucial for understanding the relational structure of evidence and the deep reasoning procedure over that. In this paper, we address this issue by proposing a graph-based reasoning framework, called the Dynamic REAsoning Machine (DREAM) framework. We first construct a semantic-level graph, where nodes are extracted by semantic role labeling toolkits and are connected by inner- and inter- sentence edges. After having the automatically constructed graph, we use XLNet as the backbone of our approach and propose a graph-based contextual word representation learning module and a graph-based reasoning module to leverage the information of graphs. The first module is designed by considering a claim as a sequence, in which case we use the graph structure to re-define the relative distance of words. On top of this, we propose the second module by considering both the claim and the evidence as graphs and use a graph neural network to capture the semantic relationship at a more abstract level. We conduct experiments on FEVER, a large-scale benchmark dataset for fact-checking. Results show that both of the graph-based modules improve performance. Our system is the state-of-the-art system on the public leaderboard in terms of both accuracy and FEVER score.

* 8pages, 4 figures

Via

Access Paper or Ask Questions