Econ. Environ. Geol. 2022; 55(2): 127-135
Published online April 30, 2022
https://doi.org/10.9719/EEG.2022.55.2.127
© THE KOREAN SOCIETY OF ECONOMIC AND ENVIRONMENTAL GEOLOGY
Correspondence to : *Corresponding author : kwkim@gist.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided original work is properly cited.
Excessive presence of As level in groundwater is a major health problem worldwide. In the Red River Delta in Vietnam, several million residents possess a high risk of chronic As poisoning. The As releases into groundwater caused by natural process through microbially-driven reductive dissolution of Fe (III) oxides. It has been extracted by Red River residents using private tube wells for drinking and daily purposes because of their unawareness of the contamination. This long-term consumption of As-contaminated groundwater could lead to various health problems. Therefore, a predictive model would be useful to expose contamination risks of the wells in the Red River Delta Vietnam area. This study used four machine learning algorithms to predict the As probability of study sites in Red River Delta, Vietnam. The GBM was the best performing model with the accuracy, precision, sensitivity, and specificity of 98.7%, 100%, 95.2%, and 100%, respectively. In addition, it resulted the highest AUC of 92% and 96% for the PRC and ROC curves, with Eh and Fe as the most important variables. The partial dependence plot of As concentration on the model parameters showed that the probability of high level of As is related to the low number of wells’ depth, Eh, and SO4, along with high PO43− and NH4+. This condition triggers the reductive dissolution of iron phases, thus releasing As into groundwater.
Keywords groundwater arsenic, machine learning, predictive model, random forest, gradient boosting
Econ. Environ. Geol. 2022; 55(2): 127-135
Published online April 30, 2022 https://doi.org/10.9719/EEG.2022.55.2.127
Copyright © THE KOREAN SOCIETY OF ECONOMIC AND ENVIRONMENTAL GEOLOGY.
Zheina J. Ottong1, Reta L. Puspasari1, Daeung Yoon2, Kyoung-Woong Kim1,*
1School of Earth Sciences and Environmental Engineering, Gwangju Institute of Science and Technology (GIST), Gwangju 61005, South Korea
2Chonnam National University, Gwangju 61186, South Korea
Correspondence to:*Corresponding author : kwkim@gist.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided original work is properly cited.
Excessive presence of As level in groundwater is a major health problem worldwide. In the Red River Delta in Vietnam, several million residents possess a high risk of chronic As poisoning. The As releases into groundwater caused by natural process through microbially-driven reductive dissolution of Fe (III) oxides. It has been extracted by Red River residents using private tube wells for drinking and daily purposes because of their unawareness of the contamination. This long-term consumption of As-contaminated groundwater could lead to various health problems. Therefore, a predictive model would be useful to expose contamination risks of the wells in the Red River Delta Vietnam area. This study used four machine learning algorithms to predict the As probability of study sites in Red River Delta, Vietnam. The GBM was the best performing model with the accuracy, precision, sensitivity, and specificity of 98.7%, 100%, 95.2%, and 100%, respectively. In addition, it resulted the highest AUC of 92% and 96% for the PRC and ROC curves, with Eh and Fe as the most important variables. The partial dependence plot of As concentration on the model parameters showed that the probability of high level of As is related to the low number of wells’ depth, Eh, and SO4, along with high PO43− and NH4+. This condition triggers the reductive dissolution of iron phases, thus releasing As into groundwater.
Keywords groundwater arsenic, machine learning, predictive model, random forest, gradient boosting
Table 1 . Selection of the best hyperparameter from the confusion matrix to predict the test set..
TN | FP | FN | TP | |
---|---|---|---|---|
LR | 110 | 2 | 11 | 31 |
RF | 111 | 1 | 8 | 34 |
GBM | 112 | 0 | 2 | 40 |
XGB | 111 | 1 | 10 | 32 |
TN: True Negative, FP: False Positive, FN: False Negative, TP: True Positive.
Table 2 . Accuracy, precision, sensitivity, and specificity of GBM and RF in percent (%).
Accuracy | Precision | Sensitivity | Specificity | |
---|---|---|---|---|
GBM | 98.7 | 100 | 95.2 | 100 |
RF | 94.1 | 97.1 | 80.9 | 99.1 |
Kalaivanan K, Vellingiri J
Econ. Environ. Geol. 2024; 57(3): 329-342Ju Young Park, Sun Young Park, Jiyoung Choi, Sungil Kim, Yuri Kim, Bo Yeon Yi, Kyungbook Lee
Econ. Environ. Geol. 2024; 57(5): 529-537Kyoungeun Lee, Jaehyung Yu, Chanhyeok Park, Trung Hieu Pham
Econ. Environ. Geol. 2024; 57(4): 353-362