A Distributional Approach to Open Questions in Market Research

Journal article
(Original article)


Publication Details

Author(s): Evert S, Greiner P, Baigger F, Lang B
Journal: Computers in Industry
Publication year: 2016
Volume: 78
Pages range: 16-28
ISSN: 0166-3615
Language: English


Abstract


Free-text responses to open questions are a rich and valuable resource in modern-day market research, but often pose problems for a traditional analysis, which requires prohibitively expensive manual coding of topic categories. The Klugator Engine (TKE) is a system for semi-automatic identification, exploration and visualization of topics and sentiment in large collections of such free-text responses or other short text fragments. The system utilizes state-of-the-art techniques of natural language processing and machine learning to transform textual input into a structured corpus, complemented by automatically determined polarity scores for individual responses. Statistical and distributional methods are then applied in order to identify semantic clusters of responses, label each topic cluster with a set of salient keywords, and evaluate the sentiment associated with the topic. This process can run in fully automated fashion, but it also offers the opportunity of interactive parameter tuning and refinement guided by the end user. Results are presented in a concise graphical visualization supported by detailed tables with numerical information. Embedded in RogTCS, the Rogator Text Clustering Solution, TKE enables customers to obtain a good overview of the main topics in a text collection comprising thousands of responses within 20 minutes of interactive exploration. An evaluation study based on a data set of more than 60,000 word tokens has shown good agreement with the topics identified by manual coding, rendering TKE a powerful tool for the analysis of unstructured textual data.



FAU Authors / FAU Editors

Evert, Stefan Prof. Dr.
Lehrstuhl für Korpus- und Computerlinguistik
Greiner, Paul
Lehrstuhl für Korpus- und Computerlinguistik


Research Fields

Corpus tools and language technology
Lehrstuhl für Korpus- und Computerlinguistik


How to cite

APA:
Evert, S., Greiner, P., Baigger, F., & Lang, B. (2016). A Distributional Approach to Open Questions in Market Research. Computers in Industry, 78, 16-28. https://dx.doi.org/10.1016/j.compind.2015.10.008

MLA:
Evert, Stefan, et al. "A Distributional Approach to Open Questions in Market Research." Computers in Industry 78 (2016): 16-28.

BibTeX: 

Last updated on 2018-23-12 at 16:10