Variable selection and data fusion for diesel cetane number prediction - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Fuel Année : 2023

Variable selection and data fusion for diesel cetane number prediction

Résumé

This study evaluates the potential of variable selection to improve the performance of data fusion modelling to estimate diesel cetane number from NIR spectroscopy information acquired on total effluent samples obtained from the hydrocracking process and their operating variables. The evaluation conducted in this research was divided into four steps. First, predictive models were developed using each data block separately. Next, seven variable selection methods were applied on the NIR block, and eleven methods were applied on the process variable block. Then, with each data set generated from the variable selection analysis, single prediction models were generated and compared with those developed in the first step. Finally, data fusion was performed once the best variable selection method was defined for each data block. Two data fusion models were generated, a first using all the variables in the two blocks and a second using only the previously selected variables. In addition, the potential of the sequential and orthogonalized covariance selection (SO-CovSel) method was also analyzed. The results showed that the data fusion modelling using all variables from each data block improves the estimation of the diesel cetane number compared to single models (about 20% reduction of the RMSEP). However, using variable selection analysis before data fusion significantly improves the estimation of this property and leads to greater model stability regarding the RMSE's and r′s (about 47% of the RMSEP). The Covariance Selection (CovSel) method was the most efficient in the NIR data block, while for the process variable data block, it was the sequential backward floating feature selection method (SBFFS) that gave the best performance. The advantages offered by the variable selection resulted not only in having a more accurate prediction of the property but also in improving the analysis and understanding of the process by determining the variables that significantly impact the property studied.
Fichier sous embargo
Fichier sous embargo
0 5 19
Année Mois Jours
Avant la publication
mardi 15 octobre 2024
Fichier sous embargo
mardi 15 octobre 2024
Connectez-vous pour demander l'accès au fichier

Dates et versions

hal-03760296 , version 1 (02-01-2023)

Identifiants

Citer

Jhon Buendía Garcia, Marion Lacoue-Negre, Julien Gornay, Sílvia Mas Garcia, Ryad Bendoula, et al.. Variable selection and data fusion for diesel cetane number prediction. Fuel, 2023, 332 (Part 2), pp.126297. ⟨10.1016/j.fuel.2022.126297⟩. ⟨hal-03760296⟩
78 Consultations
1 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More