Redaction of personal, private, and sensitive information from born-digital materials is increasingly important for repositories. Collection sizes are often too large to process without automation – the assistance of software designed specifically to identify and classify such information and present it in a format that facilitates redaction decisions. Distinguishing between items that may be redacted automatically and those that require manual intervention is similarly important. This paper examines the identification, organization, and redaction of private and sensitive information identified within born-digital materials, particularly those contained on disk images extracted from fixed and removable media carriers. We identify specific items of interest in file systems and individual file formats that may be targets for redaction, and present two approaches to managing and providing access to redacted materials using open source tools developed for the BitCurator Access project (bca-webtools) along with supporting digital forensics software.
Kam Woods, Christopher A. Lee, "Redacting Private and Sensitive Information in Born-Digital Collections" in Proc. IS&T Archiving 2015, 2015, pp 2 - 7, https://doi.org/10.2352/issn.2168-3204.2015.12.1.art00002