Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings

Pérez Toro PA, Klumpp P, Hernandez A, Arias Vergara T, Lillo P, Slachevsky A, García AM, Schuster M, Maier A, Nöth E, Orozco Arroyave JR (2022)


Publication Type: Conference contribution

Publication year: 2022

Publisher: International Speech Communication Association

Book Volume: 2022-September

Pages Range: 2483-2487

Conference Proceedings Title: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Event location: Incheon KR

DOI: 10.21437/Interspeech.2022-10883

Abstract

Cross-lingual approaches are growing in popularity in the machine learning domain, where large amounts of data are required to obtain better generalizations. Moreover, one of the biggest problems is the availability of clinical speech data, where most of the resources are in English. For instance, not many available Alzheimer's Disease (AD) corpora in different languages can be found in the literature. Despite the phonological and phonemic differences between Spanish and English, fortunately, there are also similarities between these two languages, e.g., around 40% of all words in English have a related word in Spanish. In this work, we want to investigate the feasibility of combining information from English and Spanish languages to discriminate AD. Two datasets were considered: part of the Pitt Corpus, which is composed of English speakers, and a Spanish AD dataset composed of speakers from Chile. We based our analysis on known acoustic (Wav2Vec) and word (BERT, RoBERTa) embeddings using different classifiers. Strong language dependencies were found, even using multilingual representations. We observed that linguistic information was more important for classifying English AD (F-Score=0.76) and acoustic for Spanish AD (F-Score=0.80). Using knowledge transferred from English to Spanish achieved F-scores of up to 0.85 for discriminating AD.

Authors with CRIS profile

Additional Organisation(s)

Involved external institutions

How to cite

APA:

Pérez Toro, P.A., Klumpp, P., Hernandez, A., Arias Vergara, T., Lillo, P., Slachevsky, A.,... Orozco Arroyave, J.R. (2022). Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings. In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH (pp. 2483-2487). Incheon, KR: International Speech Communication Association.

MLA:

Pérez Toro, Paula Andrea, et al. "Alzheimer's Detection from English to Spanish Using Acoustic and Linguistic Embeddings." Proceedings of the 23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022, Incheon International Speech Communication Association, 2022. 2483-2487.

BibTeX: Download