Using cluster analysis to reconstruct dengue exposure patterns from cross-sectional serological studies in Singapore

Background Dengue is a mosquito-borne viral disease caused by one of four serotypes (DENV1-4). Infection provides long-term homologous immunity against reinfection with the same serotype. Plaque reduction neutralization test (PRNT) is the gold standard to assess serotype-specific antibody levels. We analysed serotype-specific antibody levels obtained by PRNT in two serological surveys conducted in Singapore in 2009 and 2013 using cluster analysis, a machine learning technique that was used to identify the most common histories of DENV exposure. Methods We explored the use of five distinct clustering methods (i.e. agglomerative hierarchical, divisive hierarchical, K-means, K-medoids and model-based clustering) with varying number (from 4 to 10) of clusters for each method. Weighted rank aggregation, an evaluating technique for a set of internal validity metrics, was adopted to determine the optimal algorithm, comprising the optimal clustering method and the optimal number of clusters. Results The K-means algorithm with six clusters was selected as the algorithm with the highest weighted rank aggregation. The six clusters were characterised by (i) dominant DENV2 PRNT titres; (ii) co-dominant DENV1 and DENV2 titres with average DENV2 titre > average DENV1 titre; (iii) co-dominant DENV1 and DENV2 titres with average DENV1 titre > average DENV2 titre; (iv) low PRNT titres against DENV1-4; (v) intermediate PRNT titres against DENV1-4; and (vi) dominant DENV1-3 titres. Analyses of the relative size and age-stratification of the clusters by year of sample collection and the application of cluster analysis to the 2009 and 2013 datasets considered separately revealed the epidemic circulation of DENV2 and DENV3 between 2009 and 2013. Conclusion Cluster analysis is an unsupervised machine learning technique that can be applied to analyse PRNT antibody titres (without pre-established cut-off thresholds to indicate protection) to explore common patterns of DENV infection and infer the likely history of dengue exposure in a population..

Medienart:

E-Artikel

Erscheinungsjahr:

2020

Erschienen:

2020

Enthalten in:

Zur Gesamtaufnahme - volume:13

Enthalten in:

Parasites & vectors - 13(2020), 1 vom: 17. Jan.

Sprache:

Englisch

Beteiligte Personen:

Sangkaew, Sorawat [VerfasserIn]
Tan, Li Kiang [VerfasserIn]
Ng, Lee Ching [VerfasserIn]
Ferguson, Neil M. [VerfasserIn]
Dorigatti, Ilaria [VerfasserIn]

Links:

Volltext [kostenfrei]

Themen:

Cluster analysis
Dengue exposures
Serological survey

Anmerkungen:

© The Author(s) 2020

doi:

10.1186/s13071-020-3898-5

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

SPR030467101