Guidelines for the Creation of Analysis Ready Data
FOS: Computer and information sciences
Statistics - Other Statistics
Computer Science - Databases
Physics - Data Analysis, Statistics and Probability
Other Statistics (stat.OT)
FOS: Physical sciences
Databases (cs.DB)
Data Analysis, Statistics and Probability (physics.data-an)
DOI:
10.48550/arxiv.2403.08127
Publication Date:
2024-07-01
AUTHORS (7)
ABSTRACT
49 pages, 3 figures, 3 tables, and 5 appendices<br/>Globally, there is an increased need for guidelines to produce high-quality data outputs for analysis. No framework currently exists that provides guidelines for a comprehensive approach to producing analysis ready data (ARD). Through critically reviewing and summarising current literature, this paper proposes such guidelines for the creation of ARD. The guidelines proposed in this paper inform ten steps in the generation of ARD: ethics, project documentation, data governance, data management, data storage, data discovery and collection, data cleaning, quality assurance, metadata, and data dictionary. These steps are illustrated through a substantive case study that aimed to create ARD for a digital spatial platform: the Australian Child and Youth Wellbeing Atlas (ACYWA).<br/>
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....