Regular
No keywords found
 Filters
Month and year
 
  11  0
Image
Page 2,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

This paper explores issues surrounding historical archive materials that are currently not housed in a conventional archive, using a case study of biomedical illustrations created for the textbook, Grant's Atlas of Anatomy. The paper explores the following issues:• Resolving intellectual property issues when a commercial publisher owns the copyright to archival materials with high research value;• Determining what are the archival copies - the original drawings or the digital copies and where both the originals and the digital copies should be housed;• Ensuring long-term sustainability for digital archival materials that are not housed in a traditional archives.Our multi-disciplinary research project team is seeking solutions that meet the needs of all the stakeholders while ensuring the drawings are available for current and future generations of researchers. The answers lie in creative dialog between the rightsholders, the medical illustrator creator community, researchers and archivists. For the Grant's Atlas illustrations, the dialogue has at least begun.

Digital Library: ARCHIVING
Published Online: January  2011
  8  1
Image
Pages 3 - 7,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

As institutions continue to grow their collections of web-harvested content, there is an ever increasing need for tools that organize, index and share this data. Even a modest web crawl consisting of a few web sites may generate millions of harvested documents. Repeating these crawls over time greatly expands the complexity of stored data. Identifying the scope of a crawl, the location of a page within a crawl and the differences over time between crawls becomes a challenging task. In this paper we will describe a software architecture in use at the University of Maryland designed to support research on quickly extracting information about the crawls, including statistical information, and on indexing web content. While designed to support research, many of the challenges addressed in this software exist at any site which has to manage large sets of time-spanning data.Our architecture consists of two components. The first is a database application for organizing WARC-based web data called a WarcManager. The WarcManager was designed to track URL location and to allow easy extraction of crawl statistics across collections of warc-stored data. It provides both a REST-based API to harvested data as well as a portal for viewing statistics across the collection. The second component is a high performance, http based, storage service called the Simple Web-Accessible Preservation(SWAP) system. The SWAP system is distributed, novel file placement and retrieval service. It has been designed to be minimally intrusive and to allow complete data recovery even in the absence of any SWAP software.These two components have been used to successfully support research into high performance indexing of web-based content. We will describe the implementation and performance characteristics of each component as well as possible real-world uses for the system.

Digital Library: ARCHIVING
Published Online: January  2011
  10  1
Image
Pages 14 - 19,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

Data archives are built to last a lifetime. More than a lifetime. They are in place to protect all the data that man and machine create. And yet, today, the archiving storage technology itself is rushing ever faster into obsolesce and every year it becomes more difficult to move this Archived information into the next generation of storage and support expanding requirements for accessibility. The evolution of storage media used to be measured in years, now it is measured in months. So how do you plan for migrating exponentially increasing amounts of data [1] when the subsequent data migration may have to begin even before the current migration is complete?Planning a tiered data storage paradigm shift was never easy even when there was sufficient time to review options; to select the proper next step in what must be a never ending cycle of legacy data protection. But when time is short (and growing shorter), there must be migration strategies and methods established to provide not only for the next migration, but also setting the stage for all subsequent migrations. Considering strategies for migrating “like-technologies”, such as disk-to-disk or server-to-server, can often be staggered to avoid a more massive disruptive migration.But when a migration requires the core Data Management Archive Software (DMAS) technology be replaced, the impacts ripple across the total tiered data architecture with the potential to impact the user community relying on that data.This paper explores aspects of strategies about how a Tiered Data Management Environment (TDME) facing the prospect of replacing the underlying DMAS should design an approach for converting and re-casting the archive.

Digital Library: ARCHIVING
Published Online: January  2011
  8  0
Image
Page 20,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

FamilySearch acquires digital images of records having genealogical significance from all over the world. These records are typically hand written and are often in the form of volumes of bound forms housed in small archives, parish and church record repositories, and in some cases government records archives. FamilySearch has been involved in capturing images of these records from all over the world since the 1930s. Until recently these images were captured using film cameras and preserved mostly on microfilm. In recent years the film cameras have been replaced with high resolution digital cameras.Over the years FamilySearch have placed more than 3.3 million rolls of microfilm in their records vault drilled into the side of a mountain in the canyons above Salt Lake City, UT, USA. They are now in the midst of an aggressive digitization effort with the intent to create digital copies of all the images from all microfilm rolls. They expect to complete this digitization effort by end of year 2020.In addition to the microfilm scanning project, FamilySearch are collecting genealogical records in digital form at a rate of greater than 127 million images per year. These are gathered using high resolution digital cameras in a purpose-built imaging workstation. The workstations can easily be disassembled, moved to a new area and reassembled to support digitization efforts at various areas around the world. FamilySearch currently ingest approximately 33 MB of digital image data from these distributed capture stations. By the year 2020 they expect this capture rate to more than double due to increased resolution and quality of the images, as well as due to the conversion to color imaging. It is likely to even be higher than that since more capture stations are very likely to be commissioned in that same time frame.The volume of data and its rate of ingest place large constraints on both the processes involved in digital curation and the design of infrastructure to support those processes. By the end of 2011, FamilySearch expects to be managing more than 403 million images. The preservation copies of these images will consume nearly 8 PB of storage for just one copy each. The project must support high scalability in several different dimensions. For example, the total storage demands require consideration of inexpensive media, while requiring maintenance of low error loss. Multiple copies of the digital objects will increase the storage requirements even more. In addition, the system must handle a very high data throughput rate in order to sustain ingest rates of greater than 100 million images per year.In this document we describe many of the details regarding how the scale issues noted above influenced FamilySearch's digital curation processes, as well as some of the infrastructure design considerations that were made to support them. We also describe how end user access is provided and why FamilySearch chose to separate end user access from preservation repository access.

Digital Library: ARCHIVING
Published Online: January  2011
  8  1
Image
Pages 21 - 25,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

Digital curation is a fast-growing subfield of librarianship and archival management. The number of calls for digital curation positions that often require advanced knowledge of multimedia digitization practices, metadata standards, digital asset management systems and project management strategies has increased significantly over the past few years. Yet, Library and Information Science (LIS) programs rarely prepare students adequately in these disciplines. Meanwhile, digital departments are expected to sustain and grow digital collections in the face of system-wide budget reductions. Many departments are operating with fewer staff and resources than ever before. The Digital Services Department at the University of Houston Libraries (UHDS) is one such department. In response, the UHDS Internship Program was designed to provide a comprehensive digital curation experience for library school students while providing the UHDS Department with a steady, low-cost base of intern workers. This paper outlines in detail the steps taken to establish a mutually-beneficial digital curation internship program including motivations behind the decisions, results and lessons learned.

Digital Library: ARCHIVING
Published Online: January  2011
  7  0
Image
Pages 26 - 28,  © Society for Imaging Science and Technology 2011
Digital Library: ARCHIVING
Published Online: January  2011
  16  0
Image
Pages 34 - 39,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

Preservation of digital information is a vexing problem for preservation agencies. Properties or functionalities of the preserved object can be lost over time. Digital preservation is an ongoing, proactive process of preserving information and its significance over time. In this article, we conduct a theoretical and an empirical study in Swedish preservation agencies to identify and define a set of essential information properties for long-term preservation. Among identified properties, the study confirmed that knowledge on how to guarantee quality of preserved digital information is low and showed that guiding quality principles need to be established. The need for a unified information quality assurance framework was also highly confirmed.

Digital Library: ARCHIVING
Published Online: January  2011
  9  0
Image
Pages 40 - 43,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

A trusted digital repository requires the enforcement and validation of high-level policies. Within the integrated Rule-Oriented Data-management System (iRODS), these policies may be encoded as machine-actionable rules that control the execution of workflows at each storage location. Policies that manage the objects in the repository are augmented with policies to manage ingest and extraction of required metadata. This paper describes a system that provides a graphical environment for policy definition that specifies policy elements as modules, composes policies from these elements, and monitors the application of policies at ingest.

Digital Library: ARCHIVING
Published Online: January  2011
  6  0
Image
Pages 44 - 46,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

FamilySearch has the goal to preserve and publish the worlds vital records. In a presentation last year, Archives members were shown how FamilySearch's dCam-X tool can speed up and improve the process of capturing digital images. The next step is to publish the records online with useful metadata, making them accessible to the public as quickly as possible.FamilySearch is committed to digitally publishing the records it captures. The key to doing this on a large scale is identifying, gathering and processing metatdata about the content of the records being captured.

Digital Library: ARCHIVING
Published Online: January  2011
  7  0
Image
Pages 47 - 50,  © Society for Imaging Science and Technology 2011
Volume 8
Issue 1

3D digitizing describes the variety of methods by which “digital copies” of physical objects can be made. These techniques can also be used in cultural heritage preservation and digitally archiving physical objects. Images play an important role in many ways in 3D digitizing processes, especially when the real visual appearance of the object should also be stored.This document describes the meanings and possibilities of image usage in 3D digitizing. It is based on studies made in “Viva3” project at Mikkeli University of Applied Sciences. The project was about testing and developing 3D digitizing processes for different kinds of cultural heritage subjects. Image based solutions and other usage of images in 3D processes were tested with several pilot cases.

Digital Library: ARCHIVING
Published Online: January  2011