The machine learning and geostatistical approach for assessment of arsenic contamination levels using physicochemical properties of water

Arsenic contamination in groundwater due to natural or anthropogenic sources is responsible for carcinogenic and non-carcinogenic risks to humans and the ecosystem. The physicochemical properties of groundwater in the study area were determined in the laboratory using the samples collected across the Varanasi region of Uttar Pradesh, India. This paper analyses the physicochemical properties of water using machine learning, descriptive statistics, geostatistical and spatial analysis. Pearson correlation was used for feature selection and highly correlated features were selected for model creation. Hydrochemical facies of the study area were analyzed and the hyperparameters of machine learning models, i.e., multilayer perceptron, random forest (RF), naïve Bayes, and decision tree were optimized before training and testing the groundwater samples as high (1) or low (0) arsenic contamination levels based on the WHO 10 μg/L guideline value. The overall performance of the models was compared based on accuracy, sensitivity, and specificity value. Among all models, the RF algorithm outclasses other classifiers, as it has a high accuracy of 92.30%, a sensitivity of 100%, and a specificity of 75%. The accuracy result was compared to prior research, and the machine learning model may be used to continually monitor the amount of arsenic pollution in groundwater.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:88

Enthalten in:

Water science and technology : a journal of the International Association on Water Pollution Research - 88(2023), 3 vom: 24. Aug., Seite 595-614

Sprache:

Englisch

Beteiligte Personen:

Chattopadhyay, Arghya [VerfasserIn]
Singh, Anand Prakash [VerfasserIn]
Kumar, Siddharth [VerfasserIn]
Pati, Jayadeep [VerfasserIn]
Rakshit, Amitava [VerfasserIn]

Links:

Volltext

Themen:

Journal Article

Anmerkungen:

Date Revised 14.08.2023

published: Print

Citation Status Publisher

doi:

10.2166/wst.2023.231

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM360782639