Automatic speech analysis for detecting cognitive decline of older adults

BackgroundSpeech analysis has been expected to help as a screening tool for early detection of Alzheimer's disease (AD) and mild-cognitively impairment (MCI). Acoustic features and linguistic features are usually used in speech analysis. However, no studies have yet determined which type of fea...

Full description

Saved in:
Bibliographic Details
Main Authors: Lihe Huang (Author), Hao Yang (Author), Yiran Che (Author), Jingjing Yang (Author)
Format: Book
Published: Frontiers Media S.A., 2024-08-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_0f7fc2d1f2d54d44b9f823faecc5e3dd
042 |a dc 
100 1 0 |a Lihe Huang  |e author 
700 1 0 |a Lihe Huang  |e author 
700 1 0 |a Hao Yang  |e author 
700 1 0 |a Hao Yang  |e author 
700 1 0 |a Yiran Che  |e author 
700 1 0 |a Yiran Che  |e author 
700 1 0 |a Jingjing Yang  |e author 
700 1 0 |a Jingjing Yang  |e author 
245 0 0 |a Automatic speech analysis for detecting cognitive decline of older adults 
260 |b Frontiers Media S.A.,   |c 2024-08-01T00:00:00Z. 
500 |a 2296-2565 
500 |a 10.3389/fpubh.2024.1417966 
520 |a BackgroundSpeech analysis has been expected to help as a screening tool for early detection of Alzheimer's disease (AD) and mild-cognitively impairment (MCI). Acoustic features and linguistic features are usually used in speech analysis. However, no studies have yet determined which type of features provides better screening effectiveness, especially in the large aging population of China.ObjectiveFirstly, to compare the screening effectiveness of acoustic features, linguistic features, and their combination using the same dataset. Secondly, to develop Chinese automated diagnosis model using self-collected natural discourse data obtained from native Chinese speakers.MethodsA total of 92 participants from communities in Shanghai, completed MoCA-B and a picture description task based on the Cookie Theft under the guidance of trained operators, and were divided into three groups including AD, MCI, and heathy control (HC) based on their MoCA-B score. Acoustic features (Pitches, Jitter, Shimmer, MFCCs, Formants) and linguistic features (part-of-speech, type-token ratio, information words, information units) are extracted. The machine algorithms used in this study included logistic regression, random forest (RF), support vector machines (SVM), Gaussian Naive Bayesian (GNB), and k-Nearest neighbor (kNN). The validation accuracies of the same ML model using acoustic features, linguistic features, and their combination were compared.ResultsThe accuracy with linguistic features is generally higher than acoustic features in training. The highest accuracy to differentiate HC and AD is 80.77% achieved by SVM, based on all the features extracted from the speech data, while the highest accuracy to differentiate HC and AD or MCI is 80.43% achieved by RF, based only on linguistic features.ConclusionOur results suggest the utility and validity of linguistic features in the automated diagnosis of cognitive impairment, and validated the applicability of automated diagnosis for Chinese language data. 
546 |a EN 
690 |a cognitive decline 
690 |a natural language processing 
690 |a machine learning 
690 |a automatic speech recognition 
690 |a language aging 
690 |a Public aspects of medicine 
690 |a RA1-1270 
655 7 |a article  |2 local 
786 0 |n Frontiers in Public Health, Vol 12 (2024) 
787 0 |n https://www.frontiersin.org/articles/10.3389/fpubh.2024.1417966/full 
787 0 |n https://doaj.org/toc/2296-2565 
856 4 1 |u https://doaj.org/article/0f7fc2d1f2d54d44b9f823faecc5e3dd  |z Connect to this object online.