Best Subset Solution Path for Linear Dimension Reduction Models using Continuous Optimization

Methodology (stat.ME) FOS: Computer and information sciences Statistics - Other Statistics Other Statistics (stat.OT) 0101 mathematics Statistics - Computation 01 natural sciences Statistics - Methodology Computation (stat.CO)
DOI: 10.48550/arxiv.2403.20007 Publication Date: 2024-03-29
ABSTRACT
The selection of best variables is a challenging problem in supervised and unsupervised learning, especially high dimensional contexts where the number usually much larger than observations. In this paper, we focus on two multivariate statistical methods: principal components analysis partial least squares. Both approaches are popular linear dimension-reduction methods with numerous applications several fields including genomics, biology, environmental science, engineering. particular, these build components, new that combinations all original variables. A main drawback difficulty to interpret them when large. To define from most relevant variables, propose cast subset solution path method into component square frameworks. We offer alternative by exploiting continuous optimization algorithm for path. Empirical studies show efficacy our approach providing usage further exposed through real datasets. first dataset analyzed using principle while second based framework.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....