Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review

BackgroundTopic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within "big data," but outputs can be sensitive to d...

Full description

Saved in:
Bibliographic Details
Main Authors: Lauryn J Hagg (Author), Stephanie S Merkouris (Author), Gypsy A O'Dea (Author), Lauren M Francis (Author), Christopher J Greenwood (Author), Matthew Fuller-Tyszkiewicz (Author), Elizabeth M Westrupp (Author), Jacqui A Macdonald (Author), George J Youssef (Author)
Format: Book
Published: JMIR Publications, 2022-11-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_9d48bd9840e14c0581b8d480f77a4b7d
042 |a dc 
100 1 0 |a Lauryn J Hagg  |e author 
700 1 0 |a Stephanie S Merkouris  |e author 
700 1 0 |a Gypsy A O'Dea  |e author 
700 1 0 |a Lauren M Francis  |e author 
700 1 0 |a Christopher J Greenwood  |e author 
700 1 0 |a Matthew Fuller-Tyszkiewicz  |e author 
700 1 0 |a Elizabeth M Westrupp  |e author 
700 1 0 |a Jacqui A Macdonald  |e author 
700 1 0 |a George J Youssef  |e author 
245 0 0 |a Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review 
260 |b JMIR Publications,   |c 2022-11-01T00:00:00Z. 
500 |a 1438-8871 
500 |a 10.2196/33166 
520 |a BackgroundTopic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within "big data," but outputs can be sensitive to decisions made during the analytic pipeline and may not be suitable for certain scenarios such as short texts, and we highlight resources for alternative approaches. This review focuses on the complex analytical practices specific to LDA, which existing practical guides for training LDA models have not addressed. ObjectiveThis scoping review used key analytical steps (data selection, data preprocessing, and data analysis) as a framework to understand the methodological approaches being used in psychology research using LDA. MethodsA total of 4 psychology and health databases were searched. Studies were included if they used LDA to analyze written words and focused on a psychological construct or issue. The data charting processes were constructed and employed based on common data selection, preprocessing, and data analysis steps. ResultsA total of 68 studies were included. These studies explored a range of research areas and mostly sourced their data from social media platforms. Although some studies reported on preprocessing and data analysis steps taken, most studies did not provide sufficient detail for reproducibility. Furthermore, the debate surrounding the necessity of certain preprocessing and data analysis steps is revealed. ConclusionsOur findings highlight the growing use of LDA in psychological science. However, there is a need to improve analytical reporting standards and identify comprehensive and evidence-based best practice recommendations. To work toward this, we developed an LDA Preferred Reporting Checklist that will allow for consistent documentation of LDA analytic decisions and reproducible research outcomes. 
546 |a EN 
690 |a Computer applications to medicine. Medical informatics 
690 |a R858-859.7 
690 |a Public aspects of medicine 
690 |a RA1-1270 
655 7 |a article  |2 local 
786 0 |n Journal of Medical Internet Research, Vol 24, Iss 11, p e33166 (2022) 
787 0 |n https://www.jmir.org/2022/11/e33166 
787 0 |n https://doaj.org/toc/1438-8871 
856 4 1 |u https://doaj.org/article/9d48bd9840e14c0581b8d480f77a4b7d  |z Connect to this object online.