The Importance of File Formats

Managing file formats is an important topic to consider in digital preservation. In a broader context, one needs to study the application and implications of digital file formats. A full listing of recommended formats from the Library of Congress is available online. The Library of Congress’s recommended formats are based on seven sustainability factors. These include:

  • Disclosure – specifications and tools for validating the integrity and accessibility of the format exist. You can now find out how information is encoded as bits or bytes.
  • Adoption – the format is widely used. If everyone is using it, tools will be available for migration and emulation.
  • Transparency – It’s easy to analyze the format using basic tools, such as human readability. Information is not encrypted or compressed.
  • Self-Documentation – the format allows you to add metadata directly to the record. You don’t have to have a program or a database to find out what the record is.
  • External Dependencies – How much hardware or software do you need to access the format? The less specialized hardware or software you need, the better.
  • Impact of Patents – Patents could make it harder to open or migrate formats. Less of a worry with formats that are widely adopted.
  • Technical Protection Mechanisms – Formats should not be tied to a particular vendor or program. The format should be accessible regardless of the system to which it was originally uploaded.

The main content types of file formats are images, video, audio, and text. ISO compliant formats for these types of materials include:

  • PDF/A
    • Plain Text
    • XML
    • TIFF
    • JPEG2000

If you have several different file formats and versions of those formats (numerous different versions of PDF, Word, and Image formats), your digital preservation strategy should alleviate the effects of obsolescence and propagation. Strategies include file migration, emulation, normalization, and developing an institutional policy of only using certain file formats.

2023 NAGARA Conference – Cincinnati, Ohio

Looking to get some additional in-depth information regarding electronic records? This year’s National Association of Government Archives and Records Administrators Annual Conference will feature several pre-conference digital records workshops and conference presentations. Several Ohio Electronic Records Committee members will be presenting.

Individuals can attend pre-conference workshops and do not have to attend the NAGARA conference.

For a full listing of Pre-Conference Workshops – https://nagara.org/AC2023/PreCon-Workshops.aspx

Pre-Conference Workshops that Ohio ERC members will be presenting:

DATE: Tuesday, July 18, 2023, from 8:00 AM – 5:00 PM – Disaster Preparedness & Response for Record-Keepers

PRESENTERS: Nick Crossley, Becca Halbmaier, Daria Labinsky, Pari Swift (Ohio ERC Member)

PRICE: $150.00

OVERVIEW: “If you don’t like the weather, wait 5 minutes” is a popular saying in Ohio. But don’t wait until disaster strikes to come up with a recovery plan! It is vital that governments be prepared for small-scale to large-scale disasters of various types (man-made, natural, internal, external, etc.) that could affect records. In fact, some records are essential (or vital) to the recovery efforts themselves and the resumption of business. This full-day interactive workshop will be conducted by persons with a passion for disaster recovery planning and experience in government and emergency management.

DATE: Tuesday, July 18, 2023, from 1:00 PM – 5:00 PM – Introduction to Digital Imaging & Digital Preservation Requirements

PRESENTERS: Jahzerah Brooks, Nathan Owens (Ohio ERC Member), Tina Ratcliff (Ohio ERC Member), John Runion (Ohio ERC Member)

PRICE: $75.00

OVERVIEW: Access to government records and archival collections has always been important, but is now even more imperative in today’s “remote work” setting. Digital imaging of hard-copy records may be the solution to increase availability and transparency, but once a record is scanned, records programs and archives will then have digital preservation requirements and other regulations to meet. Join the Ohio Electronic Records Committee in this introductory workshop covering the journey of a “paper” record becoming a digital image and the considerations we must have for assuring digital preservation.

There are several electronic records presentations throughout the conference. Please check out the list of presentations that will be given from Wednesday, July 18th thru Friday, July 20th. https://nagara.org/AC2023/program-schedule.aspx?78b74a36c4ea=3#78b74a36c4ea

We hope to see many of you in Cincinnati this year!

Elise Kelly
Ohio ERC Member

County Records Centers and Archives and the need for a Digital Preservation System

Many County Records Centers and Archives face the challenges of modest budgets and limited staff. Oftentimes, we have to prioritize projects and rely on volunteers for additional help. Planning is a must but frequently in a small office, plans change and staff need to be able to communicate with one another and adapt to changes in plans.

For a number of years, we at the Greene County Records Center and Archives were aware that we would need to acquire a digital preservation system for the growing number of permanent/long-term County digital records. Priorities and a limited budget prolonged tackling this issue. In 2021, we began speaking with and surveying many of the County Offices/Departments regarding the direction they were going with their records. We found that Offices/Departments were either completely going digital for their records or were doing a hybrid approach – paper and electronic. We also learned that the County’s Offices/Departments’ in-house data management systems, were no longer going to be supported by the County’s IT Department. We knew that we needed to act soon for providing an alternative for these Offices/Departments.

We researched the best digital preservation systems and found that Preservica met all of our needs. Below are the following points that sold us on Preservica.

  1. Digital preservation system that preserves long-term/permanent records
  2. Addresses hardware and software obsolescence through migrations
  3. Protects digital records from degradation/corruption of formats by monitoring the fixity of records
  4. Converts records to open formats based on archival standards
  5. Secure/Controlled digital environment – backed up on five different Cloud servers around the world multiple times
    1. 24/7 monitoring
    1. Encryption of all files
  6. Can add metadata to make records searchable
  7. User Security Settings
    1. Public User – public records
    1. Internal User – Public and Confidential records
    1. Internal Administrative User – Public and Confidential records
  8. Cannot accidentally or intentionally delete records without the action being reviewed by County Records Manager. County Records Manager can deny a deletion request if the Office /Department has not signed off on an RC-3 or if the records do not meet the retention schedule.

Purchasing Preservica was just the tip of the iceberg. There were many challenges and lessons learned along our year journey of implementing Preservica. These challenges and lessons learned will be documented in a tip sheet and will be available later this year.

Elise Kelly
Greene County Records Center and Archives