Guidelines for the Creation of Analysis Ready Data

FOS: Computer and information sciences Statistics - Other Statistics Computer Science - Databases Physics - Data Analysis, Statistics and Probability Other Statistics (stat.OT) FOS: Physical sciences Databases (cs.DB) Data Analysis, Statistics and Probability (physics.data-an)
DOI: 10.48550/arxiv.2403.08127 Publication Date: 2024-07-01
ABSTRACT
49 pages, 3 figures, 3 tables, and 5 appendices<br/>Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform ten steps in the generation of ARD: ethics, project documentation, data governance, data management, data storage, data discovery and collection, data cleaning, quality assurance, metadata, and data dictionary. These steps are illustrated through a substantive case study that aimed to create ARD for a digital spatial platform: the Australian Child and Youth Wellbeing Atlas (ACYWA).<br/>
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....