A regularised logistic regression model with structured features for classification of geographical origin in olive oils

DOI: 10.1016/j.chemolab.2023.104819 Publication Date: 2023-04-10T05:43:25Z
ABSTRACT
Geographical origin of extra virgin olive oil is a factor that consumers may take into account when making purchasing decisions. Oils that are labelled to be from regions famous for olive cultivation may be assumed to be of higher quality. However, difficulties in the authentication of the geographical origin of olive oils arise due to the similarity in chemical compositions of the oils involved. Fourier-transform infrared (FTIR) spectroscopy has been found to be a viable technology for the classification of oil samples by geographical origin. However, classical methods involving dimension reduction before model fitting usually yield models that are more challenging to interpret. Sparse fused group lasso logistic regression (SFGL-LR) is used with FTIR spectroscopic data to discriminate between Greek and non-Greek organic extra-virgin olive oils. The prediction performance is also compared with that obtained by partial least squares linear discriminant analysis (PLS-LDA). While both methods give comparable good prediction performance, with more than 90% accuracy in classification, the SFGL-LR model demonstrates improvements in the interpretability of the model coefficients. ; RI 6/14 ZY
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (56)
CITATIONS (5)