Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dongming Lei

HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Oct 17, 2019

Jiaming Shen, Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, Jiawei Han

Figure 1 for HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Figure 2 for HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Figure 3 for HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Figure 4 for HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

Abstract:Taxonomies are of great value to many knowledge-rich applications. As the manual taxonomy curation costs enormous human effects, automatic taxonomy construction is in great demand. However, most existing automatic taxonomy construction methods can only build hypernymy taxonomies wherein each edge is limited to expressing the "is-a" relation. Such a restriction limits their applicability to more diverse real-world tasks where the parent-child may carry different relations. In this paper, we aim to construct a task-guided taxonomy from a domain-specific corpus and allow users to input a "seed" taxonomy, serving as the task guidance. We propose an expansion-based taxonomy construction framework, namely HiExpan, which automatically generates key term list from the corpus and iteratively grows the seed taxonomy. Specifically, HiExpan views all children under each taxonomy node forming a coherent set and builds the taxonomy by recursively expanding all these sets. Furthermore, HiExpan incorporates a weakly-supervised relation extraction module to extract the initial children of a newly-expanded node and adjusts the taxonomy tree by optimizing its global structure. Our experiments on three real datasets from different domains demonstrate the effectiveness of HiExpan for building task-guided taxonomies.

* KDD 2018 accepted

Via

Access Paper or Ask Questions

SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Oct 17, 2019

Jiaming Shen, Zeqiu Wu, Dongming Lei, Jingbo Shang, Xiang Ren, Jiawei Han

Figure 1 for SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Figure 2 for SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Figure 3 for SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Figure 4 for SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble

Abstract:Corpus-based set expansion (i.e., finding the "complete" set of entities belonging to the same semantic class, based on a given corpus and a tiny set of seeds) is a critical task in knowledge discovery. It may facilitate numerous downstream applications, such as information extraction, taxonomy induction, question answering, and web search. To discover new entities in an expanded set, previous approaches either make one-time entity ranking based on distributional similarity, or resort to iterative pattern-based bootstrapping. The core challenge for these methods is how to deal with noisy context features derived from free-text corpora, which may lead to entity intrusion and semantic drifting. In this study, we propose a novel framework, SetExpan, which tackles this problem, with two techniques: (1) a context feature selection method that selects clean context features for calculating entity-entity distributional similarity, and (2) a ranking-based unsupervised ensemble method for expanding entity set based on denoised context features. Experiments on three datasets show that SetExpan is robust and outperforms previous state-of-the-art methods in terms of mean average precision.

* ECMLPKDD 2017 accepted

Via

Access Paper or Ask Questions