Clustering analysis for the evolutionary relationships of SARS-CoV-2 strains

© 2024. The Author(s)..

To explore the differences and relationships between the available SARS-CoV-2 strains and predict the potential evolutionary direction of these strains, we employ the hierarchical clustering analysis to investigate the evolutionary relationships between the SARS-CoV-2 strains utilizing the genomic sequences collected in China till January 7, 2023. We encode the sequences of the existing SARS-CoV-2 strains into numerical data through k-mer algorithm, then propose four methods to select the representative sample from each type of strains to comprise the dataset for clustering analysis. Three hierarchical clustering algorithms named Ward-Euclidean, Ward-Jaccard, and Average-Euclidean are introduced through combing the Euclidean and Jaccard distance with the Ward and Average linkage clustering algorithms embedded in the OriginPro software. Experimental results reveal that BF.28, BE.1.1.1, BA.5.3, and BA.5.6.4 strains exhibit distinct characteristics which are not observed in other types of SARS-CoV-2 strains, suggesting their being the majority potential sources which the future SARS-CoV-2 strains' evolution from. Moreover, BA.2.75, CH.1.1, BA.2, BA.5.1.3, BF.7, and B.1.1.214 strains demonstrate enhanced abilities in terms of immune evasion, transmissibility, and pathogenicity. Hence, closely monitoring the evolutionary trends of these strains is crucial to mitigate their impact on public health and society as far as possible.

Medienart:

E-Artikel

Erscheinungsjahr:

2024

Erschienen:

2024

Enthalten in:

Zur Gesamtaufnahme - volume:14

Enthalten in:

Scientific reports - 14(2024), 1 vom: 18. März, Seite 6428

Sprache:

Englisch

Beteiligte Personen:

Chen, Xiangzhong [VerfasserIn]
Wang, Mingzhao [VerfasserIn]
Liu, Xinglin [VerfasserIn]
Zhang, Wenjie [VerfasserIn]
Yan, Huan [VerfasserIn]
Lan, Xiang [VerfasserIn]
Xu, Yandi [VerfasserIn]
Tang, Sanyi [VerfasserIn]
Xie, Juanying [VerfasserIn]

Links:

Volltext

Themen:

Journal Article

Anmerkungen:

Date Completed 20.03.2024

Date Revised 23.03.2024

published: Electronic

Citation Status MEDLINE

doi:

10.1038/s41598-024-57001-5

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM369892828