A Distributional Approach to Open Questions in Market Research

Beitrag in einer Fachzeitschrift

Details zur Publikation

Autorinnen und Autoren: Evert S, Greiner P, Baigger F, Lang B
Zeitschrift: Computers in Industry
Jahr der Veröffentlichung: 2016
Band: 78
Seitenbereich: 16-28
ISSN: 0166-3615
Sprache: Englisch


Free-text responses to open questions are a rich and valuable resource in modern-day market research, but often pose problems for a traditional analysis, which requires prohibitively expensive manual coding of topic categories. The Klugator Engine (TKE) is a system for semi-automatic identification, exploration and visualization of topics and sentiment in large collections of such free-text responses or other short text fragments. The system utilizes state-of-the-art techniques of natural language processing and machine learning to transform textual input into a structured corpus, complemented by automatically determined polarity scores for individual responses. Statistical and distributional methods are then applied in order to identify semantic clusters of responses, label each topic cluster with a set of salient keywords, and evaluate the sentiment associated with the topic. This process can run in fully automated fashion, but it also offers the opportunity of interactive parameter tuning and refinement guided by the end user. Results are presented in a concise graphical visualization supported by detailed tables with numerical information. Embedded in RogTCS, the Rogator Text Clustering Solution, TKE enables customers to obtain a good overview of the main topics in a text collection comprising thousands of responses within 20 minutes of interactive exploration. An evaluation study based on a data set of more than 60,000 word tokens has shown good agreement with the topics identified by manual coding, rendering TKE a powerful tool for the analysis of unstructured textual data.

FAU-Autorinnen und Autoren / FAU-Herausgeberinnen und Herausgeber

Evert, Stefan Prof. Dr.
Lehrstuhl für Korpus- und Computerlinguistik
Greiner, Paul
Lehrstuhl für Korpus- und Computerlinguistik


Korpuswerkzeuge und sprachtechnologische Anwendungen
Lehrstuhl für Korpus- und Computerlinguistik


Evert, S., Greiner, P., Baigger, F., & Lang, B. (2016). A Distributional Approach to Open Questions in Market Research. Computers in Industry, 78, 16-28. https://dx.doi.org/10.1016/j.compind.2015.10.008

Evert, Stefan, et al. "A Distributional Approach to Open Questions in Market Research." Computers in Industry 78 (2016): 16-28.


Zuletzt aktualisiert 2018-23-12 um 16:10