NFDI4DS | UHH-SEMS - Publication Details

Prediction-based variable selection for component-wise gradient boosting.

Methodology (stat.ME) FOS: Computer and information sciences Applications (stat.AP) 0101 mathematics Statistics - Applications 01 natural sciences Statistics - Methodology

DOI: 10.48550/arxiv.2302.13822 Publication Date: 2023-11-27

Abstract Supplemental Material References Cited by

AUTHORS (4)

Potts, Sophie

Bergherr, Elisabeth

Reinke, Constantin

Griesbach, Colin

ABSTRACT

Abstract Model-based component-wise gradient boosting is a popular tool for data-driven variable selection. In order to improve its prediction and selection qualities even further, several modifications of the original algorithm have been developed, that mainly focus on different stopping criteria, leaving the actual variable selection mechanism untouched. We investigate different prediction-based mechanisms for the variable selection step in model-based component-wise gradient boosting. These approaches include Akaikes Information Criterion (AIC) as well as a selection rule relying on the component-wise test error computed via cross-validation. We implemented the AIC and cross-validation routines for Generalized Linear Models and evaluated them regarding their variable selection properties and predictive performance. An extensive simulation study revealed improved selection properties whereas the prediction error could be lowered in a real world application with age-standardized COVID-19 incidence rates.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENAIRE - Products

PlumX Metrics

Prediction-based variable selection for component-wise gradient boosting.

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....