Metadata processing in CREDO long-term digital archive

Piotr Pałka, Tomasz Traczyk

Abstract


Long term archiving of digital resources depends, among others, on an ability of searching information, verifying its authenticity and its proper interpretation, in technical (data format, etc.), as well as semantic (understanding the information in proper context, etc.) sense. This can be achieved only by appropriate processing and storage of metadata, which describe archived resources. The paper presents processing and storage of metadata in CREDO long-term digital archive.

Keywords


digital archiving; long-term archiving; metadata

Full Text:

PDF (Polski)

References


Consultative Committee for Space Data Systems: Reference model for an open archival information system (OAIS). 2012, http://public.ccsds.org/pubs/650x0m2.pdf.

Consultative Committee for Space Data Systems: XML formatted data unit (XFDU) structure and construction rules. Recommended standard. CCSDS661.0-B-1. 2008, http://public.ccsds.org/Pubs/661x0b1.pdf.

Dublin Core Metadata Initiative: Dublin Core Metadata Element Set, Version 1.1. 2012, http://dublincore.org./documents/dces.

Exchangeable image file format for digital still cameras: Exif Version 2.31. Standard of the Camera & Imaging Products Association. CIPA DC-008-Translation-2016, http://www.cipa.jp/std/documents/e/DC-008-Translation-2016-E.pdf.

Ghosh P.: Google’s Vint Cerf warns of ‘digital Dark Age’. BBC News, http://www.bbc.com/news/science-environment-31450389.

International Press Telecommunications Council. IPTC Photo Metadata Standard. 2016, http://www.iptc.org/standards/photo-metadata/iptcstandard

JHOVE – JSTOR/Harvard Object Validation Tool, http://jhove.sourceforge.net.

Library of Congress. METS: Metadata Encoding and Transmission Standard, http://www.loc.gov/standards/mets/.

Marasek K., Walczak J., Traczyk T., Płoszajski G., Kaźmierski A.: Koncepcja elektro­nicznego archiwum wieczystego. Studia Informatica, Vol. 30, No. 2B, Gliwice 2009, s. 275÷307.

Metadata Extractor, https://github.com/drewnoakes/metadata-extractor.

Narodowe Centrum Badań i Rozwoju: DEMONSTRATOR+ Wsparcie badań naukowych i prac rozwojowych w skali demonstracyjnej, http://www.ncbr.gov.pl/programy-krajowe/demonstrator-wsparcie-badan-naukowych-i-prac-rozwojowych-w-skali-demonstracyjnej.

Pałka P., Śliwiński T., Traczyk T., Ogryczak W.: Persistence Management in Digital Document Repository. Communications in Computer and Information Science, Vol. 613, Springer, 2016, s. 668÷682.

Pater K., Traczyk T.: Opakowanie zasobów cyfrowych na potrzeby archiwizacji długoterminowej. Studia Informatica, Vol. 34, No. 2B(112), Gliwice 2013, s. 898÷103.

Płoszajski G. (red.): Standardy techniczne obiektów cyfrowych przy digitalizacji dziedzictwa kulturowego. Biblioteka Główna Politechniki Warszawskiej, 2008, http://bcpw.bg.pw.edu.pl/dlibra/docmetadata?id=1262.

The Apache Software Foundation: Apache Tika – a content analysis toolkit, https://tika.apache.org.

The National Archives UK: Digital Record Object Identification DROID: file format identification tool, http://www.nationalarchives.gov.uk/information-management/

manage-information/preserving-digital-records/droid.

The National Archives UK: PRONOM: the technical registry, http://www.nationalarchives.gov.uk/PRONOM/.

Traczyk T., Ogryczak W., Pałka P., Śliwiński T. (red.): Digital Preservation: Putting It to Work. Studies in Computational Intelligence, Vol. 700, Springer, 2017.




DOI: http://dx.doi.org/10.21936/si2017_v38.n2.804