Exploring Data Provenance in Handwritten Text Recognition Infrastructure: Sharing and Reusing Ground Truth Data, Referencing Models, and Acknowledging Contributions. Starting the Conversation on How We Could Get It Done
Ground truth
DOI:
10.46298/jdmdh.10403
Publication Date:
2024-03-18T15:05:16Z
AUTHORS (60)
ABSTRACT
This paper discusses best practices for sharing and reusing Ground Truth in Handwritten Text Recognition infrastructures, as well ways to reference acknowledge contributions the creation enrichment of data within these systems. We discuss how one can place a repository and, subsequently, inform others through HTR-United. Furthermore, we want suggest appropriate citation methods ATR data, models, made by volunteers. Moreover, when using digitised sources (digital facsimiles), it becomes increasingly important distinguish between physical object digital collection. These topics all relate proper acknowledgement labour put into digitising, transcribing, HTR data. also points broader issues surrounding use machine learning archival library contexts, community should begin record both provenance.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (0)
CITATIONS (1)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....