kNNDM: k-fold Nearest Neighbour Distance Matching Cross-Validation for map accuracy estimation
Loo
Cross-validation
Nearest neighbour
DOI:
10.5194/egusphere-2023-1308
Publication Date:
2023-07-05T05:54:13Z
AUTHORS (4)
ABSTRACT
Abstract. Random and spatial Cross-Validation (CV) methods are commonly used to evaluate machine learning-based prediction models, the obtained performance values often interpreted as map accuracy estimates. However, appropriateness of such approaches is currently subject controversy. For common case where no probability sample for validation purposes available, in Milà et al. (2022) we proposed Nearest Neighbour Distance Matching (NNDM) Leave-One-Out (LOO) CV method. This method produces a distribution geographical Distances (NND) between test train locations during that matches NND training locations. Hence, it creates predictive conditions comparable what required when predicting defined area. Although NNDM LOO produced largely reliable estimates our analysis, LOO-based method, cannot be applied large datasets found many studies. Here, propose novel k-fold strategy estimation inspired by concepts CV: (kNNDM) CV. The kNNDM algorithm tries find configuration Empirical Cumulative Distribution Function (ECDF) matched ECDF We tested simulation study with different sampling distributions compared other including performed similarly reasonably across patterns strong reductions computation time sizes. Furthermore, positive linear association quality match two ECDFs reliability provided advantages original while bypassing its size limitations.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (0)
CITATIONS (9)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....