The ability to incorporate and embed reusable preservation procedures within complex multimedia environments while minimizing the impact on production represents a substantial challenge.The San Diego Supercomputer Center has been researching and prototyping preservation cyberinfrastructure for over a decade. In this latest work, we explore a digital archiving framework which focuses on the design, development, automation and reuse of preservation processes (including accessioning, description, arrangement, storage, and preservation) which can be embedded into existing digital production environments. This current work is funded in part by a joint Library of Congress and National Science Foundation program called Digarch, “Digital Archiving and Long-Term Preservation.”We prototype this approach with a collection called “Conversations with History” which is produced in a distributed workflow environment (University of California Berkeley with Harry Kreisler and University of California TV with Lynn Burnstan). The collection includes video, audio, images, text, transcripts, web-based material, databases of administrative and descriptive metadata, and contains diverse types of data, created at multiple stages within the content production workflow. Additional partners include the University of California San Diego Libraries. The test collection is 3.5TB in size (7TB when replicated across two different storage devices).
Richard Marciano, Chien-Yi Hou, Lynn Burnstan, Harry Kreisler, Reagan Moore, Arcot Rajasekar, "Long-Term Preservation of Large-Scale Multimedia Collections: a Digital Preservation Workflow Approach" in Proc. IS&T Archiving 2006, 2006, pp 88 - 91, https://doi.org/10.2352/issn.2168-3204.2006.3.1.art00021