Development of an algorithm for finding pertussis episodes in a population-based electronic health record database
While tetanus-diphtheria-acellular pertussis (Tdap) vaccines for adolescents and adults were licensed in 2005 and immunization strategies proposed, the burden of pertussis in this population remains under-recognized mainly due to atypical disease presentation, undermining efforts to optimize protect...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Book |
Published: |
Taylor & Francis Group,
2023-01-01T00:00:00Z.
|
Subjects: | |
Online Access: | Connect to this object online. |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
MARC
LEADER | 00000 am a22000003u 4500 | ||
---|---|---|---|
001 | doaj_3f446fb9f0e24e01b9e0e9978bc7a6d3 | ||
042 | |a dc | ||
100 | 1 | 0 | |a Chathuri Daluwatte |e author |
700 | 1 | 0 | |a Maryia Dvaretskaya |e author |
700 | 1 | 0 | |a Sam Ekhtiari |e author |
700 | 1 | 0 | |a Paul Hayat |e author |
700 | 1 | 0 | |a Martin Montmerle |e author |
700 | 1 | 0 | |a Sachin Mathur |e author |
700 | 1 | 0 | |a Denis Macina |e author |
245 | 0 | 0 | |a Development of an algorithm for finding pertussis episodes in a population-based electronic health record database |
260 | |b Taylor & Francis Group, |c 2023-01-01T00:00:00Z. | ||
500 | |a 2164-5515 | ||
500 | |a 2164-554X | ||
500 | |a 10.1080/21645515.2023.2209455 | ||
520 | |a While tetanus-diphtheria-acellular pertussis (Tdap) vaccines for adolescents and adults were licensed in 2005 and immunization strategies proposed, the burden of pertussis in this population remains under-recognized mainly due to atypical disease presentation, undermining efforts to optimize protection through vaccination. We developed a machine learning algorithm to identify undiagnosed/misdiagnosed pertussis episodes in patients diagnosed with acute respiratory disease (ARD) using signs, diseases and symptoms from clinician notes and demographic information within electronic health-care records (Optum Humedica repository [2007-2019]). We used two patient cohorts aged ≥11 years to develop the model: a positive pertussis cohort (4,515 episodes in 4,316 patients) and a negative pertussis (ARD) cohort (4,573,445 episodes and patients), defined using ICD 9/10 codes. To improve contrast between positive pertussis and negative pertussis (ARD) episodes, only episodes with ≥7 symptoms were selected. LightGBM was used as the machine learning model for pertussis episode identification. Model validity was determined using laboratory-confirmed pertussis positive and negative cohorts. Model explainability was obtained using the Shapley additive explanations method. The predictive performance was as follows: area under the precision-recall curve, 0.24 (SD, 7 × 10−3); recall, 0.72 (SD, 4 × 10−3); precision, 0.012 (SD, 1 × 10−3); and specificity, 0.94 (SD, 7 × 10−3). The model applied to laboratory-confirmed positive and negative pertussis episodes had a specificity of 0.846. Predictive probability for pertussis increased with presence of whooping cough, whoop, and post-tussive vomiting in clinician notes, but decreased with gastrointestinal bleeding, sepsis, pulmonary symptoms, and fever. In conclusion, machine learning can help identify pertussis episodes among those diagnosed with ARD. | ||
546 | |a EN | ||
690 | |a algorithms | ||
690 | |a bordetella pertussis | ||
690 | |a diagnosis | ||
690 | |a electronic health record | ||
690 | |a machine learning | ||
690 | |a predictive modeling | ||
690 | |a Immunologic diseases. Allergy | ||
690 | |a RC581-607 | ||
690 | |a Therapeutics. Pharmacology | ||
690 | |a RM1-950 | ||
655 | 7 | |a article |2 local | |
786 | 0 | |n Human Vaccines & Immunotherapeutics, Vol 19, Iss 1 (2023) | |
787 | 0 | |n http://dx.doi.org/10.1080/21645515.2023.2209455 | |
787 | 0 | |n https://doaj.org/toc/2164-5515 | |
787 | 0 | |n https://doaj.org/toc/2164-554X | |
856 | 4 | 1 | |u https://doaj.org/article/3f446fb9f0e24e01b9e0e9978bc7a6d3 |z Connect to this object online. |