Quality assessment and community detection methods for anonymized mobility data in the Italian Covid context
© 2024. The Author(s)..
We discuss how to assess the reliability of partial, anonymized mobility data and compare two different methods to identify spatial communities based on movements: Greedy Modularity Clustering (GMC) and the novel Critical Variable Selection (CVS). These capture different aspects of mobility: direct population fluxes (GMC) and the probability for individuals to move between two nodes (CVS). As a test case, we consider movements of Italians before and during the SARS-Cov2 pandemic, using Facebook users' data and publicly available information from the Italian National Institute of Statistics (Istat) to construct daily mobility networks at the interprovincial level. Using the Perron-Frobenius (PF) theorem, we show how the mean stochastic network has a stationary population density state comparable with data from Istat, and how this ceases to be the case if even a moderate amount of pruning is applied to the network. We then identify the first two national lockdowns through temporal clustering of the mobility networks, define two representative graphs for the lockdown and non-lockdown conditions and perform optimal spatial community identification on both graphs using the GMC and CVS approaches. Despite the fundamental differences in the methods, the variation of information (VI) between them assesses that they return similar partitions of the Italian provincial networks in both situations. The information provided can be used to inform policy, for example, to define an optimal scale for lockdown measures. Our approach is general and can be applied to other countries or geographical scales.
Medienart: |
E-Artikel |
---|
Erscheinungsjahr: |
2024 |
---|---|
Erschienen: |
2024 |
Enthalten in: |
Zur Gesamtaufnahme - volume:14 |
---|---|
Enthalten in: |
Scientific reports - 14(2024), 1 vom: 26. Feb., Seite 4636 |
Sprache: |
Englisch |
---|
Beteiligte Personen: |
Morand, Jules [VerfasserIn] |
---|
Links: |
---|
Themen: |
---|
Anmerkungen: |
Date Completed 28.02.2024 Date Revised 25.03.2024 published: Electronic Citation Status MEDLINE |
---|
doi: |
10.1038/s41598-024-54878-0 |
---|
funding: |
|
---|---|
Förderinstitution / Projekttitel: |
|
PPN (Katalog-ID): |
NLM368993701 |
---|
LEADER | 01000caa a22002652 4500 | ||
---|---|---|---|
001 | NLM368993701 | ||
003 | DE-627 | ||
005 | 20240326235551.0 | ||
007 | cr uuu---uuuuu | ||
008 | 240229s2024 xx |||||o 00| ||eng c | ||
024 | 7 | |a 10.1038/s41598-024-54878-0 |2 doi | |
028 | 5 | 2 | |a pubmed24n1348.xml |
035 | |a (DE-627)NLM368993701 | ||
035 | |a (NLM)38409411 | ||
040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
041 | |a eng | ||
100 | 1 | |a Morand, Jules |e verfasserin |4 aut | |
245 | 1 | 0 | |a Quality assessment and community detection methods for anonymized mobility data in the Italian Covid context |
264 | 1 | |c 2024 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a ƒaComputermedien |b c |2 rdamedia | ||
338 | |a ƒa Online-Ressource |b cr |2 rdacarrier | ||
500 | |a Date Completed 28.02.2024 | ||
500 | |a Date Revised 25.03.2024 | ||
500 | |a published: Electronic | ||
500 | |a Citation Status MEDLINE | ||
520 | |a © 2024. The Author(s). | ||
520 | |a We discuss how to assess the reliability of partial, anonymized mobility data and compare two different methods to identify spatial communities based on movements: Greedy Modularity Clustering (GMC) and the novel Critical Variable Selection (CVS). These capture different aspects of mobility: direct population fluxes (GMC) and the probability for individuals to move between two nodes (CVS). As a test case, we consider movements of Italians before and during the SARS-Cov2 pandemic, using Facebook users' data and publicly available information from the Italian National Institute of Statistics (Istat) to construct daily mobility networks at the interprovincial level. Using the Perron-Frobenius (PF) theorem, we show how the mean stochastic network has a stationary population density state comparable with data from Istat, and how this ceases to be the case if even a moderate amount of pruning is applied to the network. We then identify the first two national lockdowns through temporal clustering of the mobility networks, define two representative graphs for the lockdown and non-lockdown conditions and perform optimal spatial community identification on both graphs using the GMC and CVS approaches. Despite the fundamental differences in the methods, the variation of information (VI) between them assesses that they return similar partitions of the Italian provincial networks in both situations. The information provided can be used to inform policy, for example, to define an optimal scale for lockdown measures. Our approach is general and can be applied to other countries or geographical scales | ||
650 | 4 | |a Comparative Study | |
650 | 4 | |a Journal Article | |
650 | 7 | |a RNA, Viral |2 NLM | |
700 | 1 | |a Yip, Shoichi |e verfasserin |4 aut | |
700 | 1 | |a Velegrakis, Yannis |e verfasserin |4 aut | |
700 | 1 | |a Lattanzi, Gianluca |e verfasserin |4 aut | |
700 | 1 | |a Potestio, Raffaello |e verfasserin |4 aut | |
700 | 1 | |a Tubiana, Luca |e verfasserin |4 aut | |
773 | 0 | 8 | |i Enthalten in |t Scientific reports |d 2011 |g 14(2024), 1 vom: 26. Feb., Seite 4636 |w (DE-627)NLM215703936 |x 2045-2322 |7 nnns |
773 | 1 | 8 | |g volume:14 |g year:2024 |g number:1 |g day:26 |g month:02 |g pages:4636 |
856 | 4 | 0 | |u http://dx.doi.org/10.1038/s41598-024-54878-0 |3 Volltext |
912 | |a GBV_USEFLAG_A | ||
912 | |a GBV_NLM | ||
951 | |a AR | ||
952 | |d 14 |j 2024 |e 1 |b 26 |c 02 |h 4636 |