Deriving optimal data-analytic regimes from benchmarking studies

Data-analytic regimes Benchmarking Classification trees Additive clustering; Benchmarking; Classification trees; Comparison of methods; Data-analytic regimes; Supervised learning 0504 sociology Additive clustering 05 social sciences Additive clustering; Benchmarking; Classification trees; Comparison of methods; Data-analytic regimes; Supervised learning; Statistics and Probability; Computational Theory and Mathematics; Computational Mathematics; Applied Mathematics Comparison of methods Supervised learning
DOI: 10.1016/j.csda.2016.10.016 Publication Date: 2016-10-27T17:18:29Z
ABSTRACT
In benchmarking studies with simulated data sets in which two or more statistical methods are compared, over and above the search of a universally winning method, one may investigate how the winning method may vary over patterns of characteristics of the data or the data-generating mechanism. Interestingly, this problem bears strong formal similarities to the problem of looking for optimal treatment regimes in biostatistics when two or more treatment alternatives are available for the same medical problem or disease. It is outlined how optimal data-analytic regimes, that is to say, rules for optimally calling in statistical methods, can be derived from benchmarking studies with simulated data by means of supervised classification methods (e.g., classification trees). The approach is illustrated by means of analyses of data from a benchmarking study to compare two different algorithms for the estimation of a two-mode additive clustering model.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (25)
CITATIONS (6)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....