Evaluation of two statistical approaches for estimating pollutant loads at adjacent combined sewer overflow structures

Abstract : Quantifying pollutant loads from combined sewer overflows (CSOs) is necessary for assessing impacts of urban drainage on receiving water bodies. Based on data obtained at three adjacent CSO structures in the Louis Fargue catchment in Bordeaux, France, this study implements multiple linear regression (MLR) and random forest regression (RFR) approaches to develop statistical models for estimating emitted loads of total suspended solids (TSS). Comparison between hierarchical clustering selection and random selection of CSO events for model calibration is included in model development. The results indicate that selection of the model's explanatory variables depends on both the type of approach and the CSO structure. By using the cluster technique to select representative events for model calibration, model predictability is generally improved. For the available dataset, MLR may have advantages over RFR in terms of verification performance and lower range of error due to splitting events for calibration and verification. But RFR model uncertainty bands are considerably narrower than the MLR ones.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01878119
Contributor : Jean-Luc Bertrand-Krajewski <>
Submitted on : Thursday, September 20, 2018 - 4:35:28 PM
Last modification on : Thursday, June 13, 2019 - 10:00:01 AM

Identifiers

Collections

Citation

Duy Khiem Ly, Thibaud Maruéjouls, Guillaume Binet, Xavier Litrico, Jean-Luc Bertrand-Krajewski. Evaluation of two statistical approaches for estimating pollutant loads at adjacent combined sewer overflow structures. Water Science and Technology, IWA Publishing, 2018, 78 (3), pp.699 - 707. ⟨10.2166/wst.2018.346⟩. ⟨hal-01878119⟩

Share

Metrics

Record views

34