Kölbl L, Grottke M (2020)
Publication Type: Book chapter / Article in edited volumes
Publication year: 2020
Publisher: Springer
Edited Volumes: Reliability and Statistical Computing
Series: Springer Series in Reliability Engineering
Pages Range: 193-206
DOI: 10.1007/978-3-030-43412-0_12
With topic modeling methods, such as Latent Dirichlet Allocation (LDA), we can find topics in large text collections. To efficiently employ this information, there is a need for a method that automatically analyzes the topics with respect to their usefulness for applications like the detection of new innovations. This paper presents a novel method to automatically evaluate topics produced by LDA. The new approach puts the focus on finding topics with topic words that are not only coherent, but also specific. By using the documents associated with each word to calculate background topics, a baseline can be set for each topic word that helps assess whether its context fits the topic well. Experiments indicate that the resulting topics are more manageable in terms of their interpretability. Moreover, we show that the approach can be used to detect weak signals.
APA:
Kölbl, L., & Grottke, M. (2020). Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection. In Hoang Pham (Eds.), Reliability and Statistical Computing. (pp. 193-206). Springer.
MLA:
Kölbl, Laura, and Michael Grottke. "Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection." Reliability and Statistical Computing. Ed. Hoang Pham, Springer, 2020. 193-206.
BibTeX: Download