Page-Image Error in Large-Scale Digitization

Paul Conway

doi:10.2352/issn.2168-3204.2013.10.1.art00009

Abstract

This paper presents and interprets data on digitization error gathered from four 1,000 volume random samples that represent the full range of source volumes digitized by Google and the Internet Archive over a six year period and deposited in the HathiTrust Digital Library. The paper summarizes the research method for the project and then presents summary findings on the distribution of page-image error. The findings suggest that the imperfection of digital surrogates is a transparent and nearly ubiquitous attribute of large-scale digitization and one that introduces new complexity in preservation repositories. The paper concludes with suggestions for further research.

72010361

Archiving Conference

archiving

2161-8798

Society of Imaging Science and Technology

7003 Kilworth Lane, Springfield, VA 22151, USA

2161-8798(20130101)2013:1L.36;1-

ac_v2013n1/splitsection9.xml

/ist/ac/2013/00002013/00000001/art00009

Articles

Page-Image Error in Large-Scale Digitization

ConwayPaul

01012013

2013