Functional data analysis to characterize disease patterns in frequent longitudinal data : application to bacterial vaginal microbiota patterns using weekly Nugent scores and identification of pattern-specific risk factors
© 2023. The Author(s)..
BACKGROUND: Technology advancement has allowed more frequent monitoring of biomarkers. The resulting data structure entails more frequent follow-ups compared to traditional longitudinal studies where the number of follow-up is often small. Such data allow explorations of the role of intra-person variability in understanding disease etiology and characterizing disease processes. A specific example was to characterize pathogenesis of bacterial vaginosis (BV) using weekly vaginal microbiota Nugent assay scores collected over 2 years in post-menarcheeal women from Rakai, Uganda, and to identify risk factors for each vaginal microbiota pattern to inform epidemiological and etiological understanding of the pathogenesis of BV.
METHODS: We use a fully data-driven approach to characterize the longitudinal patters of vaginal microbiota by considering the densely sampled Nugent scores to be random functions over time and performing dimension reduction by functional principal components. Extending a current functional data clustering method, we use a hierarchical functional clustering framework considering multiple data features to help identify clinically meaningful patterns of vaginal microbiota fluctuations. Additionally, multinomial logistic regression was used to identify risk factors for each vaginal microbiota pattern to inform epidemiological and etiological understanding of the pathogenesis of BV.
RESULTS: Using weekly Nugent scores over 2 years of 211 sexually active and post-menarcheal women in Rakai, four patterns of vaginal microbiota variation were identified: persistent with a BV state (high Nugent scores), persistent with normal ranged Nugent scores, large fluctuation of Nugent scores which however are predominantly in the BV state; large fluctuation of Nugent scores but predominantly the scores are in the normal state. Higher Nugent score at the start of an interval, younger age group of less than 20 years, unprotected source for bathing water, a woman's partner's being not circumcised, use of injectable/Norplant hormonal contraceptives for family planning were associated with higher odds of persistent BV in women.
CONCLUSION: The hierarchical functional data clustering method can be used for fully data driven unsupervised clustering of densely sampled longitudinal data to identify clinically informative clusters and risk-factors associated with each cluster.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2023 |
---|---|
Erschienen: |
2023 |
Enthalten in: |
Zur Gesamtaufnahme - volume:23 |
---|---|
Enthalten in: |
BMC medical research methodology - 23(2023), 1 vom: 26. Okt., Seite 251 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Biswas, Rahul [VerfasserIn] |
---|
Links: |
---|
Anmerkungen: |
Date Completed 30.10.2023 Date Revised 10.02.2024 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1186/s12874-023-02063-8 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM363769927 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM363769927 | ||
003 | DE-627 | ||
005 | 20240210232812.0 | ||
007 | cr uuu---uuuuu | ||
008 | 231226s2023 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1186/s12874-023-02063-8 |2 doi | |
028 | 5 | 2 | |a pubmed24n1286.xml |
035 | |a (DE-627)NLM363769927 | ||
035 | |a (NLM)37884907 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Biswas, Rahul |e verfasserin |4 aut | |
245 | 1 | 0 | |a Functional data analysis to characterize disease patterns in frequent longitudinal data |b application to bacterial vaginal microbiota patterns using weekly Nugent scores and identification of pattern-specific risk factors |
264 | 1 | |c 2023 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 30.10.2023 | ||
500 | |a Date Revised 10.02.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2023. The Author(s). | ||
520 | |a BACKGROUND: Technology advancement has allowed more frequent monitoring of biomarkers. The resulting data structure entails more frequent follow-ups compared to traditional longitudinal studies where the number of follow-up is often small. Such data allow explorations of the role of intra-person variability in understanding disease etiology and characterizing disease processes. A specific example was to characterize pathogenesis of bacterial vaginosis (BV) using weekly vaginal microbiota Nugent assay scores collected over 2 years in post-menarcheeal women from Rakai, Uganda, and to identify risk factors for each vaginal microbiota pattern to inform epidemiological and etiological understanding of the pathogenesis of BV | ||
520 | |a METHODS: We use a fully data-driven approach to characterize the longitudinal patters of vaginal microbiota by considering the densely sampled Nugent scores to be random functions over time and performing dimension reduction by functional principal components. Extending a current functional data clustering method, we use a hierarchical functional clustering framework considering multiple data features to help identify clinically meaningful patterns of vaginal microbiota fluctuations. Additionally, multinomial logistic regression was used to identify risk factors for each vaginal microbiota pattern to inform epidemiological and etiological understanding of the pathogenesis of BV | ||
520 | |a RESULTS: Using weekly Nugent scores over 2 years of 211 sexually active and post-menarcheal women in Rakai, four patterns of vaginal microbiota variation were identified: persistent with a BV state (high Nugent scores), persistent with normal ranged Nugent scores, large fluctuation of Nugent scores which however are predominantly in the BV state; large fluctuation of Nugent scores but predominantly the scores are in the normal state. Higher Nugent score at the start of an interval, younger age group of less than 20 years, unprotected source for bathing water, a woman's partner's being not circumcised, use of injectable/Norplant hormonal contraceptives for family planning were associated with higher odds of persistent BV in women | ||
520 | |a CONCLUSION: The hierarchical functional data clustering method can be used for fully data driven unsupervised clustering of densely sampled longitudinal data to identify clinically informative clusters and risk-factors associated with each cluster | ||
650 | 4 | |a Journal Article | |
650 | 4 | |a Research Support, N.I.H., Extramural | |
650 | 4 | |a Research Support, Non-U.S. Gov't | |
650 | 4 | |a Functional data clustering | |
650 | 4 | |a Intra-person variability | |
650 | 4 | |a Longitudinal data analysis | |
650 | 4 | |a Unsupervised learning | |
650 | 4 | |a Vaginal flora | |
700 | 1 | |a Thoma, Marie |e verfasserin |4 aut | |
700 | 1 | |a Kong, Xiangrong |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t BMC medical research methodology |d 2001 |g 23(2023), 1 vom: 26. Okt., Seite 251 |w (DE-627)NLM111431921 |x 1471-2288 |7 nnns |
773 | 1 | 8 | |g volume:23 |g year:2023 |g number:1 |g day:26 |g month:10 |g pages:251 |
856 | 4 | 0 | |u http://dx.doi.org/10.1186/s12874-023-02063-8 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 23 |j 2023 |e 1 |b 26 |c 10 |h 251 |