Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Keyword Assisted Embedded Topic Model

Nov 22, 2021

Bahareh Harandizadeh, J. Hunter Priniski, Fred Morstatter

Figure 1 for Keyword Assisted Embedded Topic Model

Figure 2 for Keyword Assisted Embedded Topic Model

Figure 3 for Keyword Assisted Embedded Topic Model

Figure 4 for Keyword Assisted Embedded Topic Model

Share this with someone who'll enjoy it:

Abstract:By illuminating latent structures in a corpus of text, topic models are an essential tool for categorizing, summarizing, and exploring large collections of documents. Probabilistic topic models, such as latent Dirichlet allocation (LDA), describe how words in documents are generated via a set of latent distributions called topics. Recently, the Embedded Topic Model (ETM) has extended LDA to utilize the semantic information in word embeddings to derive semantically richer topics. As LDA and its extensions are unsupervised models, they aren't defined to make efficient use of a user's prior knowledge of the domain. To this end, we propose the Keyword Assisted Embedded Topic Model (KeyETM), which equips ETM with the ability to incorporate user knowledge in the form of informative topic-level priors over the vocabulary. Using both quantitative metrics and human responses on a topic intrusion task, we demonstrate that KeyETM produces better topics than other guided, generative models in the literature.

* 8 pages, 5 figures, WSDM 2022 Conference

View paper on

Share this with someone who'll enjoy it:

Title:Keyword Assisted Embedded Topic Model

Paper and Code