End-to-end digital preservation workflow: ingest, virus scan, normalization, METS/PREMIS metadata, AIP storage, and DIP access packages.
Archiving & digital preservation
Trustworthy long-term stewardship: OAIS-style processing, PREMIS/METS, BagIt transfer, finding aids, GLAM publishing, institutional and data repositories, and web archiving (WARC). Prefer tools with clear standards alignment and documented exit paths from proprietary hosts.
Tools in this category (16)
Multisite web platform for scholarly and cultural collections: linked open data, resource templates, modules, and IIIF-friendly patterns.
PHP/MySQL platform for online collections and exhibits—simple item Dublin Core, themes, and plugin ecosystem.
Web-based archival description application aligned with ISAD(G), ISAAR, RAD, and DACS—multi-level finding aids and authority records.
Archival information management for accessioning, description, locations, agents, and public discovery interfaces.
Institutional repository platform for research outputs, theses, datasets, and OAI-PMH harvesting—DSpace 7+ Angular UI.
Collection management and cataloguing for museums, archives, and libraries—highly configurable metadata profiles and media handling.
Linked data–capable digital object repository (API-X, versioning, fixity) often paired with Samvera/Hyrax for scholarly preservation.
Samvera Rails application providing deposit, workflow, discovery, and admin UI on top of Fedora repositories.
Turnkey research data management repository (Zenodo lineage): records, DOIs, OAI-PMH, permissions, and customizable deposit forms.
Library of Congress reference implementation for BagIt: payload manifests, tag manifests, and fetch.txt for transfer packaging.
Java desktop GUI from the Library of Congress for building valid BagIt packages with human-friendly validation feedback.
Artefactual command-line tool to schedule and report checksum audits against storage locations—pairs with Archivematica storage.
High-performance Python web archive replay stack (WARC) used by Webrecorder and many institutions for Wayback-style access.
Extensible, web-scale, archival-quality crawler produced by the Internet Archive for capturing sites into WARC files.
Harvard-led research data repository: datasets, files, citations, DOI workflows, and granular permissions for institutions.
