Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Jun 08, 2021

Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, Yichi Zhang, Zelin Dai

Figure 1 for Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Figure 2 for Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Figure 3 for Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Figure 4 for Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Share this with someone who'll enjoy it:

Abstract:Entity Matching (EM) aims at recognizing entity records that denote the same real-world object. Neural EM models learn vector representation of entity descriptions and match entities end-to-end. Though robust, these methods require many resources for training, and lack of interpretability. In this paper, we propose a novel EM framework that consists of Heterogeneous Information Fusion (HIF) and Key Attribute Tree (KAT) Induction to decouple feature representation from matching decision. Using self-supervised learning and mask mechanism in pre-trained language modeling, HIF learns the embeddings of noisy attribute values by inter-attribute attention with unlabeled data. Using a set of comparison features and a limited amount of annotated data, KAT Induction learns an efficient decision tree that can be interpreted by generating entity matching rules whose structure is advocated by domain experts. Experiments on 6 public datasets and 3 industrial datasets show that our method is highly efficient and outperforms SOTA EM models in most cases. Our codes and datasets can be obtained from https://github.com/THU-KEG/HIF-KAT.

View paper on

Share this with someone who'll enjoy it:

Title:Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

Paper and Code