Alert button

Topics as Entity Clusters: Entity-based Topics from Language Models and Graph Neural Networks

Add code
Alert button
Jan 06, 2023
Figure 1 for Topics as Entity Clusters: Entity-based Topics from Language Models and Graph Neural Networks
Figure 2 for Topics as Entity Clusters: Entity-based Topics from Language Models and Graph Neural Networks
Figure 3 for Topics as Entity Clusters: Entity-based Topics from Language Models and Graph Neural Networks
Figure 4 for Topics as Entity Clusters: Entity-based Topics from Language Models and Graph Neural Networks

Share this with someone who'll enjoy it:

Topic models aim to reveal the latent structure behind a corpus, typically conducted over a bag-of-words representation of documents. In the context of topic modeling, most vocabulary is either irrelevant for uncovering underlying topics or contains strong relationships with relevant concepts, impacting the interpretability of these topics. Furthermore, their limited expressiveness and dependency on language demand considerable computation resources. Hence, we propose a novel approach for cluster-based topic modeling that employs conceptual entities. Entities are language-agnostic representations of real-world concepts rich in relational information. To this end, we extract vector representations of entities from (i) an encyclopedic corpus using a language model; and (ii) a knowledge base using a graph neural network. We demonstrate that our approach consistently outperforms other state-of-the-art topic models across coherency metrics and find that the explicit knowledge encoded in the graph-based embeddings provides more coherent topics than the implicit knowledge encoded with the contextualized embeddings of language models.

* 12 pages, 1 figure  

Share this with someone who'll enjoy it: