A metadata generation system for scanned scientific volumes
Digitization
DOI:
10.1145/1378889.1378918
Publication Date:
2008-06-17T13:49:02Z
AUTHORS (4)
ABSTRACT
Large scale digitization projects have been conducted at digital libraries to preserve cultural artifacts and provide permanent access. The increasing amount of digitized resources, including scanned books scientific publications, requires development tools methods that will efficiently analyze manage large collections resources. In this work, we tackle the problem extracting metadata from volumes journals. Our goal is extract information describing internal structures content volumes, which necessary for providing effective access functionalities library users. We propose automatically generating volume level, issue article level based on format text features extracted OCRed text. show performance our system bound historical documents nearly two centuries old. developed integrated it into an operational library, Internet Archive, real-world usage.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (24)
CITATIONS (12)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....