The potential of new ensemble machine learning models for effluent quality parameters prediction and related uncertainty
Biochemical oxygen demand
Predictive modelling
Ensemble Learning
DOI:
10.1016/j.psep.2020.04.045
Publication Date:
2020-05-04T20:21:33Z
AUTHORS (3)
ABSTRACT
Abstract Accurate simulation of wastewater effluent parameters is a vital concern to reduce the operational costs of a wastewater treatment plant. In this way, a reliable predictive model is a necessity to achieve an acceptable performance. This study represents a novel approach to predict the effluent quality parameters for an industrial wastewater treatment plant in Qom province, Iran. Three new ensemble machine learning models called Ada Boost Regression (ABR), Gradient Boost Regression (GBR) and Random Forest Regression ( R F R ) are used to predict the effluent quality parameters including Total Dissolved Solids (TDS), five-day Biochemical Oxygen Demand (BOD5), and Chemical Oxygen Demand (COD) in daily scale. The gamma test technique is used to obtain the optimistic predictive variables. The performance accuracy of the predictive models is assessed based on several metrics indices and visual performance indicators. Results show that the ABR model provides the most performance for predicting the TDS ( C C = 0.962 , R M S E = 30.3 m g l ) while the GBR offers a better accuracy to simulate the BOD5 ( C C = 0.9 , R M S E = 4.6 m g l ) and COD ( C C = 0.75 , R M S E = 9.6 m g l ) parameters. The findings obtained from uncertainty analysis indicate that the prediction results are more sensitive to model structure ( R - f a c t o r T D S = 0.52 , R - f a c t o r B O D = 0.89 and R - f a c t o r C O D = 1.06 ) than the input variables ( R - f a c t o r T D S = 0.21 , R - f a c t o r B O D = 0.67 and R - f a c t o r C O D = 0.62 ).
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (66)
CITATIONS (143)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....