Digitisation Standard

  Continuum Logo
Issued under the Public Records Act 2005, Section 27
Scope: All Public Offices and Local Authorities
Status: Discretionary Standard
 

S-6 Archives New Zealand's Standard for Digitising Non-Electronic Records for Recordkeeping Purposes and Retention of Non-Electronic Records in Electronic Form Only.

This standard sets out requirements for digitisation processes for public offices and local authorities to support the requirements of the Electronic Transactions Act 2002 and Public Records Act 2005
 
Chief Executive and Chief Archivist
 
Diannce Macaskill Signature
Signed:
 
Date: 29 January 2007
 
Review Date: 2010 

Contents

Acknowledgements
Introduction
Glossary
Purpose
Applicability
Mandate
Legislative Framework
Scope

Requirements

Summary of Mandatory Requirements
Preliminary Considerations
1.0. All Digitisation and Digitisation Processes must be Planned, Scoped and Documented
2.0. Systems to Support Management of the Digital Output of Digitisation as Records must be in Place
3.0 Disposal of all Records must be Authorised and Documented
4.0 Long-Term Management Systems, where Required, for both Source and Digitised Records, Must be Documented and Implemented

Mandatory Appendices

Appendix 1: Checklist of Mandatory Requirements
Appendix 2: Compliance Certification

Guidance Appendices

Appendix 3: Checklist of best Practise Recommendations
Appendix 4: Issues to Consider when Assessing the Viability of Digitisation
Appendix 5: Recommended Technical Specifications
Appendix 6: File-naming Metadata Recommendations
Appendix 7: Quality Assurance Recommendations
Appendix 8: Recommended Staff Skill Sets
Appendix 9: Further Resources
 

Acknowledgements

This Standard was developed for the Government Recordkeeping Group, Archives New Zealand by Barbara Reed, Recordkeeping Innovation Pty Ltd., and Stephen Clarke, Archives Analyst, Archives New Zealand, with advice and guidance from an advisory group comprising:

The Chief Archivist gratefully acknowledges the contributions of the advisory group.
 
The development of this Standard was informed by international professional literature and draws upon various institutional standards and policies. In particular the ISO standards on digital imaging and information documentation1; and two publications issued by the Queensland State Archives: ‘Digitisation Disposal Authority’ (April 2006) and ‘Guidelines for the Digitisation of Paper Records’ (May 2005). Please refer to Appendix 9: Further Resources for further information.
 
Note: 1. ISO 12653 – 1: 2000, Electronic imaging – Test targets for the black and white scanning of office documents, Part 1 – characteristics;
ISO 12653-2: 2000, Electronic imaging – Test targets for black and white scanning of office documents, Part 2 – methods of use;
ISO/TR 15801: 2004, Electronic imaging – Information stored electronically – Recommendations for trustworthiness and reliability;
ISO 23081 – 1: 2006, Information and documentation – Records management processes – Metadata for records, Part 1 – Principles; and
ISO 23081 – 1: 2006, Information and documentation – Records management processes – Metadata for records, Part 2 – Implementation issues.

 

Introduction

This Standard is for use in the design and conduct of responsible digitisation by all organisations which are covered by the Public Records Act 2005.
 
Meeting the mandatory requirements of this Standard will enable public offices and local authorities to meet their legislative obligations and dispose of original source records after digitisation. This applies to records where the business action takes place on the digitised record, rather than on the non-digital original source record. Public offices may then implement the General Disposal Authority: Digitised Original Source Records. The Standard sets out the criteria that will allow local authorities to be confident that they have met the requirements of the Electronic Transactions Act 2002 and dispose of the original source protected records.
 
The Standard provides a framework for digitisation back-capture projects although not the disposal authority; these records must have a specific disposal authority and are subject to an appraisal process prior to any consideration of disposal.
 
With the shift to electronic recordkeeping systems, many public offices and local authority organisations in New Zealand are digitising original paper, or other non-digital, records. The official record of activity, regardless of the format, is the one on which the business action was taken; this is increasingly becoming the digitised copy of a non-digital original. Public offices and local authorities are legally responsible for ensuring that they create full and accurate records of their activities and that these records are maintained over time for subsequent reference. This responsibility applies regardless of the records' storage media.
 
Digitisation, also known as imaging or scanning, is the means of converting hard-copy, or non-digital, records into digital format. It may also involve taking digital photographs of the source records. Once converted to digital objects, they may be:

For the purposes of this Standard, digitisation can broadly be categorised into two types, further defined in the Glossary and Scope sections:

Digitisation offers the following potential benefits to organisations:

There are a number of risks associated with implementing a digitisation process:

Continuum logo Back to top

Glossary

Further explanation of technical terminology is included in the glossary in Appendix 5: Recommended Technical Specifications.
 

Obligation Terms:

In this Standard, obligation is indicated by the following terms, when they are capitalised within the text of the requirements:
 
Must
Means that the item is a mandatory requirement of the Standard.
 
Should
Means that valid reasons may exist in particular circumstances to ignore a particular item, but the full implications have to be understood and carefully weighed before choosing a different course.
 
May
Means that an item is truly optional. Some implementers may choose to include the item because it serves a particular local requirement or because it enhances the digitisation process, while others may omit the same item.
 

General Terms:

In this Standard, the following terms are used as defined below:
 
Born digital
Digital materials which are created and retained only in digital form, without ever having had a non-digital equivalent. This term is used to differentiate them from:

  1. digital materials which have been created as a result of converting non-digital originals; and
  2. digital materials which may have originated from a digital source but have been printed to paper.

[Derived from Digital Preservation Coalition, Handbook. http://www.dpconline.org/graphics/intro/definitions.html]
 
Business system
An organised collection of hardware, software, supplies, policies, procedures and people, which stores, processes and provides access to an organisation's business resources [Source: National Archives of Australia, Recordkeeping Glossary. http://www.naa.gov.au/recordkeeping/rkpubs/recordkeeping_glossary.html]. A business system will create records, but may or may not manage them according to records management requirements. A recordkeeping system is a specific type of business system with the dedicated functionality of managing an organisation's record resources.
 
Business-process digitisation
Digitisation of records, often on receipt, and incorporation of the digital record into a business system supporting ongoing organisational activities. Such digitisation may take place in conjunction with the operation of an electronic document and records management system. In business-process digitisation, the business action takes place on the digital record and, therefore, the official record of the action is the digital record.

Derivative
An image that has been created from another image, such as the master image, through some kind of image editing process to create a user or working copy. The process usually involves a loss of information to reduce the size by sampling it to a lower resolution, using lossy compression techniques, or altering an image using image processing techniques. Typically, such derivatives are made for purposes such as web access, including "thumbnail" images that might be only 100 pixels square, or as "reference" or "service" images that should fit completely within an average monitor. Images created for this purpose commonly have smaller file sizes and, therefore, do not require a fast network connection and are in web-viewable formats.

Digitisation
Also known as imaging or scanning, digitisation is the means of converting hard-copy or non-digital records into digital format. Hard-copy or non-digital records include audio, visual, image or text. Digitisation may also be undertaken by taking digital photographs of the source records, where appropriate.

Digitisation projects
Retrospective or project-based digitisation of existing sets of non-digital records to enhance accessibility, maximise re-use or for preservation purposes. In digitisation projects, the business action is already completed and was undertaken on the non-digital record, which remains the official record regardless of the existence of a digital copy.

Indexing
The process of establishing access points to facilitate retrieval of records and/or information. [Source: ISO 15489-2001]

Master
A faithful digital reproduction of a document, optimised for longevity and for production of a range of delivery versions (derivatives). Masters are captured at the highest practicable quality or resolution and stored for long-term usage. Typically, masters are stored in an off-line mode on tape or CD and are accessed only for the production of derivative images.

Metadata

Data describing context, content and structure of records and their management through time. [Source: ISO 15489-2001]

Protected record (local government)
A local authority record declared by the Chief Archivist to be a protected record by notice in the Gazette. This list of local authority protected records is known as the 'Local Government Schedule'. Local authority records are the classes of local government records that are created or received by a local authority in the conduct of its affairs.

Public record
Any record created or received by a public office in the conduct of its affairs. See section 4 of the Public Records Act 2005 for a full definition.

Re-use
A use of the record that is different from and separate to the original process that resulted in the creation of the record.

Recordkeeping System
"[I]nformation systems which capture, maintain and provide access to records over time." (AS 4390-1996, Part 1, 4.20). These systems must have records management functionality embedded; refer to Archives New Zealand's Continuum Resource Kit for further guidance.

Redaction
The process of removing or masking unwanted or sensitive areas of a document prior to showing it to others.

Source record
A document or record that has been copied, converted or migrated or will be the input for such a process. A source record may be an original record or it may be a reproduction that was generated by an earlier copying, conversion or migration process. [Source: National Archives of Australia,
Recordkeeping Glossary. http://www.naa.gov.au/recordkeeping/rkpubs/recordkeeping_glossary.html] 
Continuum logo Back to top

Purpose

This Standard establishes a framework for maintaining public records and local government protected records in digital format only, where the original paper, or other non-digital source record, has been copied electronically by digitising or other means. It sets out the mandatory requirements for digitisation processes and for the implementation of the accompanying Disposal Authority: Digitised Original Source Records, and guidance for local authorities to meet the requirements of the Electronic Transactions Act 2002. The Standard outlines best practice recommendations, particularly addressing the following broad areas:

Applicability

This Standard applies to public offices and local authorities. Where public offices and local authorities contract third parties to carry out (local) government functions, this Standard applies to the records documenting those (local) government functions.
 

Mandate

This is a discretionary standard issued under section 27 of the Public Records Act 2005.
 
However, where public offices wish to use the accompanying General Disposal Authority: Digitised Original Source Records to destroy original source records, the mandatory requirements of the Standard, as set out in Appendix 1, must be met, and certified as outlined in Appendix 2. Compliance with the Standard also means that the Chief Archivist and the public office can be confident that the requirements of s 25(1) of the Electronic Transactions Act 2002 are met by the digitised record.
 
Local authorities which meet the mandatory provisions of the Standard can be confident they meet the requirements of s 25(1) of the Electronic Transactions Act 2002. They can then dispose of the original source records. 
 
Continuum logo Back to top

Legislative Framework

Legislation that has an impact on digitisation of public and local authority protected records includes:

Public offices need to satisfy the mandatory requirements established in this Standard as a pre-condition to implementing the General Disposal Authority: Digitised Original Source Records, issued under the Public Records Act 2005.
 
By satisfying the requirements of this Standard, a local authority can be confident that it meets the requirements of s 25(1) of the Electronic Transactions Act 2002. It may then legally dispose of the non-digital form of the protected record and retain the record in electronic form only.
 
A checklist to assist agencies to assess whether they meet the requirements of this Standard is included as Appendix 1.
 
Evidence Act 2006 (as amended), governs the admissibility of documents as evidence in legal proceedings.
 
Section 138 of the Evidence Act 2006 provides that a copy which is certified to be a copy of the public document held by a local authority is admissible in court to prove the truth of its contents.
 
Being able to show that the organisation's digitisation processes met the Standard would be relevant in the event of a challenge to the authenticity of a public office or local authority's digitised form of the original source record. 
Continuum logo Back to top

Scope

This Standard applies to the processes of converting non-digital original records (including audio, visual, image, text, or microform) into digital images. This Standard also covers the subsequent management of both the original records and their digitised counterparts.
 
It applies to:

Once non-digital records are converted to digital images, many of the management and preservation issues associated with the digital image are the same as those for born-digital records.
 
Out of scope:
This Standard does not cover:

Note: 2. For guidance on these and other issues, please see Archives New Zealand's other Continuum publications: Fact Sheets, General Disposal Authorities, Guides and Standards. 
 
Continuum logo Back to top

Requirements

Summary of Mandatory Requirements

 
1.0. All Digitisation and Digitisation Processes must be Planned, Scoped and Documented:

2.0. Systems to support management of the Digital Output of Digitisation as Records must be in place:

3.0. Disposal of all Records must be authorised and documented:

4.0. Long-term mangement systems, where required, for both source and digitised records, must be documented and implemented:

(See Appendix 1 for a full checklist of mandatory requirements.)
 

Preliminary Considerations

The rationale for digitising SHOULD be carefully aligned to a costed business case geared at improving the organisation's ability to carry out its functions. The business case SHOULD clearly outline the benefits and anticipated business or cost efficiencies. The business case SHOULD involve appropriate project budgets, resource commitments and be realistically costed.
 
Digitisation can involve extensive document preparation and requirements for indexing, which combined can comprise the majority of a digitisation budget.
 
Agencies should not be misled by casual advice that digitisation is a cheap option. Digitisation undertaken as a space saving device is rarely justified, especially when the costs of future migration projects are factored in.
 
Appendix 4 provides a set of questions to assist in assessing the viability of digitisation for non-digital records.
 
Master Copies and Derivatives
 
Master copies of digitised records are those maintained as a separate and inviolable record in a safe storage environment, executed to the highest technical specifications available.
 
Derivative versions, where required, SHOULD be made during the digitisation process. Master copies SHOULD be made available for the creation of subsequent derivative images, where necessary.
 
The maintenance of a master record separated from the business process MAY not be necessary for business-process digitisation, where the digitised record:

Official Record
 
The official record of action is the record on which the business action takes place, identified as the record which is used in the transaction of business, regardless of whether this is the master or a derivative. Where a digitised copy of an original source record is the official record for business actions then the original is no longer the official record, it can then be assessed for disposal. If the original non-digital record is the record on which the business action still takes place, then it is still the official record and CANNOT be considered for disposal.
 
Mandatory Requirement
 
Preliminary Requirement: The official record of action MUST be identified as the one which is used in the transaction of business.
 
NB If the original source record is still the record of business action it cannot be disposed of. If the subsequent business action is taken on the digitised copy the original source record can be considered for disposal.
 
Continuum logo Back to top

1.0. All Digitisation and Digitisation Processes MUST be Planned, Scoped and Documented

The project documentation SHOULD include:

Mandatory Requirement
 
1.0 All digitisation and digitisation processes MUST be planned, scoped and documented.
 

1.1. An Appropriate Digitisation Approach MUST be Selected, Documented and Implemented

A number of approaches, or a combination of approaches, to digitisation can be adopted. The approach selected needs to be documented.
 
Decisions on each of the following areas will contribute to the overall approach employed:

The way these decisions are approached will be different for business-process digitisation and for digitisation projects and will vary across organisations.
 
Regardless of which approach to digitisation is adopted, the following will apply:

Mandatory Requirements
1.1 An appropriate digitisation approach MUST be selected, documented and implemented.
1.1.1 Quality control and assurance processes MUST be implemented regardless of the digitisation approach adopted.
 

1.2. Technical Specifications Aligned to the Digitisation Requirements MUST be Selected, Documented and Implemented

A large volume of technical standards associated with digitisation are available. Such standards include recommendations on:

Specific recommendations on technical specifications for digitisation are detailed in Appendix 5.
 
These standards are rapidly evolving, especially in the area of technical capacity of equipment to accommodate such standards. The primary consideration in adopting technical specifications is to ensure the legibility of the digitised image. The following basic criteria SHOULD be adhered to when selecting technical standards:

Mandatory Requirements
1.2 Technical specifications aligned to the digitisation requirements MUST be selected, documented and implemented.

 

1.3. Equipment and Software Aligned to the Digitisation Requirements MUST be Implemented

The quality of equipment and software used in digitising significantly affects the capability to support appropriate technical standards and, therefore, to ensure longevity of the digital image produced. Where destruction of the original source records is contemplated, agencies MUST be able to assert confidence in the long-term viability of those digitised images requiring on-going retention.
The following general rules apply:

Readers available to users SHOULD support the display of the digital image in a manner, and to a quality, acceptable for the business being conducted. This can involve reviewing equipment specifications. For example, if the quality of the colour on a map is critical, the quality of the equipment used to render the image needs to support the capacity to retrieve and analyse this quality. If, on the other hand, it is only essential to be able to read the contents to gain the sense of the text, the quality of display could be appropriately less critical.
 
Response times required by end users can affect the storage media used for images. For example, if DVDs are used as an offline storage media, the retrieval or response time for a user to receive a copy of the requested image will be at least partly dependent on the hardware loading the relevant DVD onto the system. This timeframe might be too long for the purpose of the query.
 
Mandatory Requirements
1.3 Equipment and software aligned to the digitisation requirements MUST be implemented.
1.3.1 Where destruction of the source record is contemplated, agencies MUST be able to assert confidence in the long-term viability of the images requiring on-going retention.
1.3.2 During the digitisation process, the use of techniques that enhance the digitised image to make the image have a more exact resemblance to the original, or reproduce the source records' appearance more faithfully, MUST be documented.
 
Continuum logo Back to top

2.0. Systems to Support Management of the Digital Output of Digitisation as Records MUST be in Place

The purpose of the digitisation will determine the functionality and system requirements necessary to manage the digital outputs.
 
Prior to digitisation, consideration of third party copyright or other constraints inherent in the record SHOULD be resolved.
 
Before the final version is reached, the digitising process could create a number of versions of a digitised record – for example, a raw file, one which has been enhanced to maximise resemblance to the original, or one to which quality assurance processes have been applied. Each digitisation process will specify the parameters of acceptable enhancements (see Requirement 2.1). The final output of the digitisation process MUST be regarded as the record for incorporation into the organisation's recordkeeping framework.
 
Many public offices and local authorities use electronic document and records management systems as their image control systems. Such systems SHOULD be assessed for functionality against Archives New Zealand's Electronic Recordkeeping Systems Standard (see the Continuum Resource Kit).
 
Where the digital image is to be used as a record in current or continuing business, the system governing the business process which will use the image SHOULD be integrated with records control systems. This ensures that the digital image will inherit the business controls and metadata associated with the business process, will be lodged within its business context and will have its authenticity enhanced by integration with the normal business systems.
 
Digitisation projects are often undertaken as a preservation measure to extend the life of fragile original records or to increase access by users to the informational content of records. Where the results of such projects are not immediately linked to a pre-existing business system, consideration of the software to be used to manage the images is necessary. Where the digital images represent a considerable investment of time and resources, serious consideration SHOULD be given to purchasing a management system to ensure appropriate control of processes such as identification, indexing, classification, security and access controls, rights management and preservation.
 
Mandatory Requirements
2.0 Systems to support management of the digital output of digitisation as records MUST be in place.
2.0.1 The final output of the digitisation process MUST be regarded as the record for incorporation into the organisation's recordkeeping framework.
 

2.1.  Source Record Preparation Guidelines MUST be Documented and Implemented

The aim of producing digital images is to reproduce the original as faithfully as possible so that the digital image can act in place of the original, where it is required to act as evidence of business activities.
 
Source record preparation guidelines are not restricted to, but SHOULD include:

Mandatory Requirement
2.1 Source record preparation guidelines MUST be documented and implemented.

 

2.2. All Digitised Images MUST be Assigned Metadata to Document Digitising Processes and to Support Ongoing Buisness Procedures3

Metadata attributed to, or associated with, images is an essential component in the management and retrieval of images. Wherever possible metadata SHOULD be inherited from existing systems.
 
Two types of metadata MUST be captured:

The majority of this metadata can be automatically sourced from the software and hardware used to manage the digitising process. The manual attribution or application of metadata SHOULD be minimised.
 
Metadata can be embedded with the resource in header information, or can be managed in a separate system, or both, but in either case there has to be a direct relationship or association between them. That is, while metadata may reside in a separate system, it has to link directly to the resources. Metadata can also be encapsulated within the image format. Choices for storage of metadata SHOULD be made according to the principles outlined in ISO 23081 – 1: 2006, Information and documentation – Records management processes – Metadata for records.
 
Image-level metadata
Image-level metadata SHOULD be generated automatically at the point of digital capture direct from the digitisation equipment and SHOULD avoid manually-assigned data entry wherever possible.
 
In addition to the metadata inherited from recordkeeping capture and processes, or from indexing and searching metadata, image-level metadata MUST include:

Additional image-level metadata MAY be assigned at the discretion of the public office or local authority.
 
Recommendations on naming protocols for digital images and directories are included in Appendix 6: File-naming Metadata Recommendations.
 
Business-process digitisation
Wherever possible the metadata controlling the business process and the recordkeeping functions associated with the business process SHOULD govern, and be inherited by, the specific digital image. This metadata MUST be incorporated into the organisation's electronic recordkeeping framework and be consistent with ISO 23081 – 1: 2006, Information and documentation – Records management processes – Metadata for records. Additional metadata describing the process of digitisation and specific characteristics of the digitised image MUST be included, as outlined above.
 
Digitisation projects
Where access to content is the primary driver, more intensive attention to providing additional indexing and search entry points is appropriate. The images can be managed as individual items, rather than as context-linked records, particularly if intended for web access by external users. Relevant elements from the New Zealand Government Locator Service (NZGLS) metadata SHOULD be associated with each image.
 
Mandatory Requirements
2.2 All digitised images MUST be assigned metadata to document digitising processes and to support ongoing business processes.
2.2.1 Two types of metadata MUST be captured:

2.2.2 In addition to the metadata inherited from recordkeeping capture and processes, or from indexing and searching metadata, image-level metadata MUST include:

2.2.3 In business-process digitisation, records metadata MUST be incorporated into the organisation's electronic recordkeeping framework and be consistent with ISO 23081 Metadata for records, Parts 1 and 2.
 

2.3. Quality Assurance Procedures MUST be Defined, Documented and Implemented

Quality assurance to ensure the digital copy of the source record is a true and accurate copy is critical to being able to assert that the records possess integrity and are authentic.
 
Such quality assurance procedures SHOULD be documented and built into the ongoing operation of the digitising process, not considered simply a check on output only.
 
Quality assurance procedures SHOULD, at minimum, address the following issues:

Quality checking MUST be completed before the digitised images are accepted into a business process, or as a master copy in the case of digitisation projects. Quality checking MUST be complete before the destruction of the source records is considered.
 
The results of quality assurance processes and quality checks MUST be documented.
 
A review of quality procedures for digitising SHOULD be undertaken regularly to ensure that the procedures continue to meet the business requirements.
 
Appropriate training SHOULD be provided to all staff who create, manage or work with digitised records. Documentation on the level and the frequency of training provided to those staff involved with digitisation SHOULD be created and maintained.
 
For further information on quality assurance, see Appendix 7: Quality Assurance Recommendations, and Appendix 8: Recommended Staff Skill Sets.
 
Mandatory Requirements
2.3 Quality assurance procedures MUST be defined, documented and implemented.
2.3.1 Quality checking MUST be completed before the digitised images are accepted into a business process, or as a master copy in the case of digitisation projects.
2.3.2 Quality checking MUST be complete before the destruction of the source records is considered.
2.3.3 The results of quality assurance processes and quality checks MUST be documented.

2.4. Storage Media and Back-Up Procedures MUST be Defined, Documented and Implemented.

Maintaining technology-dependent records in networked storage is currently the storage strategy most likely to ensure the continuing accessibility of such records over time. However, digitised records can occupy significant storage space depending on the quality, resolution and compression ratios employed. Strategies for storage MAY include:

The requirements for storage include:

All digitised records, and their associated metadata, MUST be included in the organisation's back-up regime. Back-up procedures are designed to provide sufficient up-to-date copies of business records to be used in the event of loss or corruption of all or part of the data.
 
Back-up regimes SHOULD be documented and back-up copies maintained to a level of security that will ensure the authenticity of the records used in recovery situations.
 
All system failures MUST be documented, and use of the back-up copies for restoration purposes MUST be accompanied by verification testing to ensure the integrity of the restored records.
 
Information technology professionals often use the term 'archiving' to describe back-up regimes. For recordkeeping purposes conducting back-ups does not constitute an archiving or preservation strategy, it is a business continuity or disaster recovery precaution.
 
Mandatory Requirements
2.4 Storage media and back-up procedures MUST be defined, documented and implemented.
2.4.1 Digitised records MUST be unalterable in all storage media.
2.4.2 Security and access controls for controlling storage media MUST be capable of detecting and logging unauthorised attempts at access.
2.4.3 All digitised records, and their associated metadata MUST be included in the organisation’s back-up regime.
2.4.4 All system failures MUST be documented.
2.4.5 The use of the back-up copies for restoration purposes MUST be accompanied by verification testing to ensure the integrity of the restored records.
 
3. Please note: This section has been designed not to be prescriptive, but to identify areas of critical importance for maintaining the authenticity and integrity of the digitised record. This enables organisations to adapt specific elements as required and to maximise the inheritance of data values from existing systems and equipment. It is designed to maximise automatic capture of metadata, not manual attribution.
 
Continuum logo Back to top

3.0. Disposal of All Records MUSTbe Authorised and Documented

3.1. Disposal of Source Records MUST be Auhtorised in Accordance with Relevant Legislation and Documented

Original source records cannot be disposed of without first meeting the requirements of any relevant statutes. This Standard sets out mandatory requirements that will satisfy the Chief Archivist that the requirements of the Public Records Act 2005 and Electronic Transactions Act 2002 have been met.
 
The full list of mandatory requirements that will ensure that a digitised record provides a reliable means of assuring the maintenance of the integrity of the source record are set out in Appendix 1.
 
Once the mandatory requirements are met and compliance has been signed-off by the Chief Executive using the certification form in Appendix 2, then public offices can implement disposal actions under the General Disposal Authority: Digitised Original Source Records. By meeting the mandatory requirements established in this standard, a local authority organisation can be confident that it meets the requirements of s 25(1) of the Electronic Transactions Act 2002, and may then legally dispose of the non-digital form of the protected record and retain the record in electronic form only.
 
All decisions to dispose of source records and disposal actions MUST be documented and this information MUST be accessible and be produced on request. Authorisation for destruction and the instance of destruction of the original source record SHOULD be documented in the metadata associated with the digitised record. Disposal actions SHOULD be documented and authorised by the relevant authority in an organisation.
 
Digitised records MUST be (re)producible in their original format on request. Any person who requests information from an organisation is not obliged to accept it in electronic form without their consent, in accordance with section 16 of the Electronic Transactions Act 2002.
 
General Disposal Authority: Digitised Original Source Records
Public offices should note that the General Disposal Authority: Digitised Original Source Records includes exclusions and appraisal considerations before records can be routinely destroyed. (See the accompanying General Disposal Authority: Digitised Original Source Records document for full details of relevant records classes).
 
Summary of Exclusions
After being digitised, the following categories of records will NOT be authorised for disposal under the General Disposal Authority: Digitised Original Source Records (see accompanying document for full details):

Statement of criteria for disposing of digitised original source records
The accompanying General Disposal Authority: Digitised Original Source Records outlines the following criteria that public offices MUST consider when disposing of any digitised original source records:

 
Mandatory Requirements
3.0 Disposal of all records MUST be authorised and documented.
3.1 Disposal of source records MUST be authorised in accordance with relevant legislation, and documented (this information MUST be accessible and be produced on request).
3.1.1 Certification of compliance against the requirements of this Standard MUST be authorised by the Chief Executive, before the disposal of original source records can be considered.
3.1.2 Digitised records MUST be (re)producible in their original format on request.
 

3.2. Disposal of the Digitised Records Incorporated into Buisness Systems MUST be Authorised and Documented

Where digitised records are incorporated into business systems as the official business record, they are regarded as the authentic 'official record' for evidential purposes. Digitised business records are subject to normal processes for disposal authorisations, established by the Chief Archivist under the Public Records Act 2005, in the same way as the original source records. A digitised record acting in place of an original source record MUST be retained for the authorised minimum period as established in the relevant disposal authority for the non-digital original source record. Once disposal is authorised, all extant copies of the digitised record SHOULD be destroyed (e.g. back-up copies). Authorisation of disposal of the digitised copy MUST be documented.
 
Disposal actions MUST be documented in the metadata associated with the record.
 
Mandatory Requirements
3.2 Disposal of the digitised record incorporated into the business system MUST be authorised and documented.
3.2.1 Digitised records acting in place of an original source record MUST be retained for the authorised minimum period as established in the relevant disposal authority for the original source record.
3.2.2
Disposal actions MUST be documented in the metadata associated with the record.
 
Continuum logo Back to top

4.0. Long-Term Management Systems, where Required, for both Source and Digitised Records, MUST be Documented and Implemented

4.1. Sound Management Systems Must be in Place for Originalk Source Records Until their Authorised Disposal within a Recordkeeping Framework

Source records MUST be managed appropriately until their authorised disposal within a recordkeeping framework. Where source records are retained for whatever reason other than quality control, or are not authorised for destruction, systematic controls MUST be applied. The digital image and the source record SHOULD be linked.
 
Where digitised records are incorporated into business systems and the source records are retained for reasons other than quality control checks, controls MUST be applied to their maintenance. The source records SHOULD be organised to maximise retrieval and to enable the efficient application of management and disposal processes. Day boxing, the process of accumulating source records in chronological order, or in their digitisation sequence, is not recommended. Day boxing is rarely suited to efficient management and disposal processes as it eliminates contextual linkages and mixes records intended for short-term retention with material subject to longer retention periods.
 
Digitisation projects rarely have the principle aim of enabling the destruction of the source record. After the digitisation process, source records SHOULD be returned to their original context and order, reflecting the processes of their creation and management in their original format. This enables the existing finding aids to continue to function as a retrieval tool for the records.
 
Mandatory Requirements
4.0 Long-term management systems, where required, for both source and digitised records, MUST be documented and implemented.
4.1 Sound management systems MUST be in place for original source records until their authorised disposal within a recordkeeping framework.
4.1.1 Where digitised records are incorporated into business systems and the source record retained for reasons other than quality control checks, systematic controls MUST be applied to their maintenance.
 

4.2. Migration and/or Preservation Strategies and Processes MUST be Defined, Documented and Implemented

Technology-dependent electronic records are inherently vulnerable to hardware, software and media obsolescence.
 
Digitised records SHOULD be included in the recordkeeping framework adopted by the organisation to support the continuing existence of records for as long as they are required.
 
A migration strategy aims to transfer the record (and its associated metadata) into subsequent generations of software, hardware and media, in ways that preserve the authenticity and integrity of the record, and enables it to continue to be used as an authorised record of business.
 
All migration strategies MUST identify which record objects and associated metadata are required to enable agencies to continue to access the authentic digitised record. The original metadata MUST be migrated into any system replacing the one in which the digital image was originally managed.
 
Any decision not to migrate a digitised record into subsequent generations of software and hardware is a disposal action, and as such MUST be supported by either:

A dedicated preservation environment, either within an organisation or within Archives New Zealand, is one which identifies and documents the content and context of the record, including its linkages to other records and an event history relevant to processes applied to and uses made of the record. Typically, such an environment applies strict procedural controls on the format of both the record and its associated metadata. The preservation environment aims to extend the existence of the record once it is no longer required for active conduct of the business which created or managed it. The preservation environment MUST support the retrieval of the record for as long as required.
 
Mandatory Requirements
4.2 Migration and/or preservation strategies and processes MUST be defined, documented and implemented.
4.2.1 All migration strategies MUST identify which record objects and associated metadata are required to enable organisations to continue to access the authentic digitised record.
4.2.2 The original metadata MUST be migrated into any system replacing the one in which the digital image was originally managed.
4.2.3 Any decision not to migrate a digitised record into subsequent generations of software/hardware is a disposal action and MUST be supported by either:

4.2.4 The preservation environment MUST support the retrieval of the record for as long as required.
 
Continuum logo Back to top

Appendix 1: Checklist of Mandatory Requirements

This checklist is a statement of mandatory requirements to meet the Archives New Zealand Digitisation Standard. Compliance with these requirements will satisfy the Chief Archivist that the requirements of the Public Records Act and Electronic Transactions Act have been met by public offices and local authorities. Organisations MUST also certify that these criteria have been met (see Appendix 2) before disposing of original source records. In order for public offices to use the General Disposal Authority: Digitised Original Source Records meeting and certifying that these requirements have been met is a mandatory pre-condition.
 

Standard Section Reference  Requirement No Yes
Preliminary Requirement The official record of action MUST be identified as the one which is used in the transaction of business.
NB: If the original source record is still the record of business action it cannot be disposed of. If the subsequent business action is taken on the digitised copy of the original source record it can be considered for disposal.
   
1.0 All digitisation and digitisation processes MUST be planned, scoped and documented.    
1.1 An appropriate digitisation approach MUST be selected, documented and implemented.    
1.1.1 Quality control and assurance processes MUST be implemented regardless of the digitisation approach adopted.    
1.2 Technical specifications aligned to the digitisation requirements MUST be selected, documented and implemented.    
1.3 Equipment and software aligned to the digitisation requirements MUST be implemented.    
1.3.1 Where destruction of the source record is contemplated, agencies MUST be able to assert confidence in the long-term viability of the images requiring on-going retention.    
1.3.2 During the digitisation process, the use of techniques that enhance the digitised image to make the image have a more exact resemblance to the original, or reproduce the source records' appearance more faithfully, MUST be documented.    
2.0 Systems to support management of the digital output of digitisation as records MUST be in place.    
2.0.1 The final output of the digitisation process MUST be regarded as the record for incorporation into the organisation's recordkeeping framework.    
2.1 Source record preparation guidelines MUST be documented and implemented.    
2.2 All digitised images MUST be assigned metadata to document digitising processes and to support ongoing business processes.    
2.2.1 Two types of metadata MUST be captured:
- Metadata specific to the particular image and the imaging process; and
- Metadata about the record, the business being transacted and the agents associated with the business.
   
2.2.2 In addition to the metadata inherited from recordkeeping capture and processes, or from indexing and searching metadata, image-level metadata MUST include:
- Unique digital image identifier;
- Date and time of digitisation;
- The name of the agent associated with the digitisation process;
- Capture device;
- Calibration settings; and
- Date of last calibration.
   
2.2.3 In business-process digitisation, records metadata MUST be incorporated into the organisation's electronic recordkeeping framework and be consistent with ISO 23081 Metadata for records, Parts 1 and 2.    
2.3 Quality assurance procedures MUST be defined, documented and implemented.    
2.3.1 Quality checking MUST be completed before the digitised images are accepted into a business process, or as a master copy in the case of digitisation projects.    
2.3.2 Quality checking MUST be complete before the destruction of the source records is considered.    
2.3.3 The results of quality assurance processes and quality checks MUST be documented.    
2.4 Storage media and back-up procedures MUST be defined, documented and implemented.    
2.4.1 Digitised records MUST be unalterable in all storage media.    
2.4.2 Security and access controls for controlling storage media MUST be capable of detecting and logging unauthorised attempts at access.    
2.4.3 All digitised records, and their associated metadata MUST be included in the organisation's back-up regime.    
2.4.4 All system failures MUST be documented.    
2.4.5 The use of the back-up copies for restoration purposes MUST be accompanied by verification testing to ensure the integrity of the restored records.    
3.0 Disposal of all records MUST be authorised and documented.    
3.1 Disposal of source records MUST be authorised in accordance with the relevant legislation, and documented (this information must be accessible and be produced on request).    
3.1.1 Certification of compliance against the requirements of this Standard MUST be authorised (See Appendix 2 for Compliance Certification). Public offices may then apply the General Disposal Authority: Digitised Original Source Records.

Local authorities may dispose of original source records having demonstrated compliance with the Electronic Transactions Act, 2002.

NB If all the mandatory requirements of this standard are not met an ad hoc disposal authority for public records MUST be sought from the Chief Archivist before any disposal action.

   
3.1.2 Digitised records MUST be (re)producible in their original format on request.    
3.2 Disposal of the digitised record incorporated into the business system MUST be authorised and documented.    
3.2.1 Digitised records acting in place of an original source record MUST be retained for the authorised minimum period as established in the relevant disposal authority for the original source record.    
3.2.2 Disposal actions MUST be documented in the metadata associated with the record.    
4.0 Long-term management systems, where required, for both source and digitised records, MUST be documented and implemented.    
4.1 Sound management systems MUST be in place for original source records until their authorised disposal within a recordkeeping framework.    
4.1.1 Where digitised records are incorporated into business systems and the source record retained for reasons other than quality control checks, systematic controls MUST be applied to their maintenance.    
4.2 Migration and/or preservation strategies and processes MUST be defined, documented and implemented.    
4.2.1 All migration strategies MUST identify which record objects and associated metadata are required to enable organisations to continue to access the authentic digitised record.    
4.2.2 The original metadata MUST be migrated into any system replacing the one in which the digital image was originally managed.    
4.2.3 Any decision not to migrate a digitised record into subsequent generations of software/hardware is a disposal action and MUST be supported by either:
- Authorisation of disposal of the record and its associated metadata in accordance with the Public Records Act 2005; or
- Transfer of the record and its associated metadata into a dedicated preservation environment.
   
4.2.4 The preservation environment MUST support the retrieval of the record for as long as required.    

 
Continuum logo Back to top

Appendix 2: Compliance Certification

This certificate of compliance MUST be signed by the Chief Executive of a public office prior to the disposal of original source records under the General Disposal Authority: Digitised Original Source Records issued by the Chief Archivist. The purpose of this certification is to enable public offices to qualify to implement the General Disposal Authority: Digitised Original Source Records.


For local authorities it enables them to show they have met the requirements of s 25(1) of the Electronic Transactions Act 2002. Local authorities may then dispose of their original source records.


It will also act as clear evidence of senior management responsibility for digitised records, to ensure that there is organisational confidence in the business systems incorporating digitised images. It is an acknowledgment of the complex management requirements necessary to ensure that public records or local authority protected records in digitised forms are maintained with integrity and authenticity for as long as required.
 

Certificate of Compliance with Archives New Zealand's Digitisation Standard
 
 
I, ........................................ , Chief Executive of
               (name)
 
.........................................
         (organisation)
confirm that the organisation has demonstrated to my satisfaction that the mandatory requirements of Archives New Zealand's Digitisation Standard as set out in Appendix 1 (summarised below) have been and can continue to be met.
 
Public office Chief Executives - I understand that meeting this Standard is a pre-condition for the organisation implementing the General Disposal Authority: Digitised Original Source Records issued by the Chief Archivist under s 20(1) of the Public Records Act 2005.
 
Local authority Chief Executives - I understand that meeting this Standard provides assurance that the organisation's digitisation procedures meet the requirements of s 25(1) of the Electronic Transactions Act 2002 and that the original paper or non-electronic format records may now be destroyed.

Summary of disposal requirements

(The full list of mandatory requirements to ensure that a digitised record provides a reliable means of assuring the maintenance of the integrity of the source record is set out in Archives New Zealand's Digitisation Standard: Appendix 1.)
 
Continuum logo Back to top

Appendix 3: Checklist of Best Practise Recommendations

This checklist is a statement of the best practice recommendations outlined in the Standard.
 

Standard Section Reference Best Practice Guidance No Partly Yes
Preliminary Requirement The rationale for digitising SHOULD be carefully aligned to a costed business case geared at improving the organisation's ability to carry out its functions.      
The business case SHOULD clearly outline the benefits and anticipated business or cost efficiencies.      
The business case SHOULD involve appropriate project budgets, resource commitments and be realistically costed.      
Derivative versions, where required, SHOULD be made during the digitisation process.      
Derivative versions, where required, SHOULD be made during the digitisation process.      
Master copies SHOULD be made available for the creation of subsequent derivative images, where necessary.      

 
 

Standard Section Reference Best Practice Guidance No Partly Yes
1. Planning and Processes The project documentation SHOULD include:
- Scope definition: with clear identification of business drivers, objectives, scale, size and constraints of the project;
     
- Statement of the purpose and expected uses of the digitised records, illustrated if necessary with examples;      
- Statement of benefits: clear identification of the benefits anticipated from the digitisation;      
- Statement of user needs and impacts;      
- Statement of technical standards adopted: including format, compression and metadata;      
- Equipment and resources to support the digitisation;      
- Processes for the planning, control and execution of the digitisation, including those undertaken prior to, during and after digitisation;      
- Quality assurance processes;      
- Strategies for integrating the digitised image into work processes to support the business action taking place; and      
- Strategies for the ongoing management of the digitised records for as long as they are required to be maintained.      
The digitisation approach SHOULD be regularly reviewed for continuing relevance and cost effectiveness.      
- The highest technical specifications that can be realistically supported SHOULD be incorporated into the digitisation process.      
- Formats SHOULD be open source, have published technical specifications available in the public domain, or be widely deployed within the relevant sector.      
- Formats SHOULD not contain embedded objects, or link out to external objects beyond the specific version of the format.      
- Formats SHOULD be supported by many software applications and operating systems.      
- Formats SHOULD be able to be read by utilising a readily-available viewing plug-in if the specific production software is not available to all users.      
- A body of accessible and product-independent technical expertise SHOULD be available to support the decision.      
- Adequate technical support SHOULD exist to enable ongoing maintenance and assurance of migration capability when necessary.      
- Master copies SHOULD be created to the highest technical standards achievable.      
- Master copies SHOULD be retained inviolable in secure storage.      
Where software that is in place to manage digital images after capture enables additions of annotations to images, these annotations SHOULD be managed as overlays that do not change the actual image.      
Printing of the image SHOULD be possible with or without the annotations.      
Readers available to users SHOULD support the display of the digital image in a manner, and to a quality, acceptable for the business being conducted.      
2. Management Systems Prior to digitisation, consideration of third party copyright or other constraints inherent in the record SHOULD be resolved.      
Where an electronic document and records management system is used as the image control system, it SHOULD be assessed for functionality against Archives New Zealand's Electronic Recordkeeping Standard.      
Where the digital image is to be used as a record in current or continuing business, the system governing the business process which will use the image SHOULD be integrated with records control systems.      
Where digitisation is undertaken for preservation purposes or primarily to enhance external access to information, care is exercised to ensure that the appropriate systematic controls are in place to manage the digital image. Consideration SHOULD be given to purchasing a management system to ensure appropriate control of processes such as identification, indexing, classification, security and access controls, rights management and preservation.      
Source record preparation guidelines SHOULD include:      
- An assessment of the source records’ capability to sustain a digitisation process;      
- Methods of digitising non-standard source records;      
- Quality checks to ensure against data loss in digitisation;      
- Methods for dealing with source records containing handwritten annotations, marginalia, white opaque paint, or highlighted areas;      
- Methods of distinguishing between original source records and photocopies;      
- Guidance on what types of material need not be digitised as they are only of ephemeral or short-term value;      
- Physical preparation for digitisation;      
- Processes for assigning links between associated documents to be regarded as a single item, so that the digitised image can faithfully represent the source record;      
- Processes for assigning links between the source record and the digitised copy;      
- Procedures to enable checking and verification that all target source records have been included in the digitisation process; an authorisation for destruction and the instance of destruction of the original source record SHOULD be documented in the metadata associated with the digitised record.      
Where possible, metadata attributed to, or associated with, images SHOULD be inherited from existing systems.      
The manual attribution or application of metadata SHOULD be minimised.      
Choices for storage of metadata SHOULD be made according to the principles outlined in ISO 23081 Metadata for records.      
Image-level metadata SHOULD be generated automatically at the point of digital capture direct from the digitisation equipment and SHOULD avoid manually-assigned data entry wherever possible.      
In business-process digitisation, wherever possible the metadata controlling the business process and the recordkeeping functions associated with the business process SHOULD govern, and be inherited by, the specific digital image.      
Particularly if intended for web access by external users, relevant elements from NZGLS metadata SHOULD be associated with each image.      
Quality assurance procedures SHOULD be documented and built into the ongoing operation of the digitisation process.      
Quality assurance procedures SHOULD, at minimum, address the following issues:      
- Any acceptable variations from normal procedures;      
- Scanner operation quality control;      
- Verification that digital output matches the quantity of original record input;      
- Extent and frequency of sampling of digitised images;      
- Criteria for checking image quality;      
- Frequency and criteria for checks on metadata;      
- Processes for re-digitising; and      
- Operator training.      
A review of quality procedures for digitising SHOULD be undertaken regularly to ensure that the procedures continue to meet business requirements.      
Appropriate training SHOULD be provided to all staff who create, manage or work with digitised records.      
Documentation on the level and the frequency of training provided to all levels of staff involved with digitisation SHOULD be created and maintained.      
Retrieval times implicit in offline storage SHOULD be acceptable for the business being conducted.      
Wherever possible digitised records sharing similar retention periods SHOULD be co-located to enable execution of destruction processes as required.      
Back-up regimes SHOULD be documented.      
Back-up copies SHOULD be maintained to a level of security that will ensure the authenticity of the records used in recovery situations.      
3. Disposal Processes Authorisation for destruction and the instance of destruction of the original source record SHOULD be documented in the metadata associated with the digitised record.      
Disposal actions SHOULD be documented and authorised by the relevant authority in an organisation.      
Once authorised for destruction, all extant copies of the digitised record SHOULD be destroyed (e.g. back-ups).      
4. Long-term Management The digital image and the source record SHOULD be linked.      
Where digitised records are incorporated into business systems and the source records are retained for reasons other than quality control checks, the source records SHOULD be organised to maximise retrieval and to enable efficient management of retention and disposal processes.      
In digitisation projects, source records SHOULD be returned to their original context and order after the digitisation process.      
Digitised records SHOULD be included in the framework adopted by the organisation to support the continuing existence of records for as long as they are required.      

 
 
See the following appendices for specific guidance on:

Continuum logo Back to top

Appendix 4: Issues to Consider when Assessing the Viability of Digitisation

This appendix is for guidance. It provides a set of questions to assist organisations in assessing the viability of digitisation for original source records. It relates to the 'Preliminary Considerations' section of the 'Requirements' element of the Standard.
 
Business-process digitisation

Digitisation projects

 
Continuum logo Back to top

Appendix 5: Recommended Technical Specifications

This appendix contains recommended technical specifications for digitisation. The recommendations are the least prescriptive commonly-used industry specifications – that is, public offices and local authorities can exceed these recommendations. These recommendations are not mandatory. That is, where justifiable, public offices and local authorities can adopt other technical standards. The primary consideration in adopting technical standards is to ensure the legibility of the digitised image. This appendix relates to Requirement 1.2: Technical specifications aligned to the digitisation requirements MUST be selected, documented and implemented.
 

Document Type Resolution Bit Depth File Format Compression
Text only, black and white Minimum 300ppi 1 bit (bi-tonal) TIFF
PDF/A4 containing TIFF or JPEG 20005
Lossless compression
Documents with watermarks, grey shading, grey graphics Minimum 600 ppi 8 bit greyscale TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Documents with discrete colour used in text or diagrams Minimum 600 ppi Minimum: 8 bit colour TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Black and white photographs Sufficient to provide >3000 pixels across long dimensions 8 bit greyscale TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Colour photographs Sufficient to provide >3000 pixels across long dimensions 24 bit colour TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Black and white negatives Sufficient to provide >3000 pixels across long dimensions 8 bit greyscale or 24 bit colour TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Colour negatives and transparencies Sufficient to provide >3000 pixels across long dimensions 24 bit colour TIFF
JPEG 2000
PDF/A containing TIFF or JPEG 2000
Lossless compression
Microforms See note below

 
Note: When scanning microforms, the approach should be to emulate the methods detailed above consistent with the source document on the (typically greyscale) microform – that is, to produce a minimum resolution of 600 ppi (in relation to the original document). However, this may vary for textual records to focus more on creating digital images with reasonable or good legibility. JPEG 2000 and PDF/A are recommended formats.
Recommended Technical Specifications Glossary
 

Area Issues
File formats Categories of file formats are:
 
Raster: also known as bit-mapped formats, where images take the form of a grid or matrix with each picture element (pixel) having a unique location and independent colour value. Examples are TIFF, JPG/JPEG, GIF, PNG;
 
Vector: also known as object oriented, based on a set of mathematical instructions typically used by drawing programs to construct an image – not of relevance to digitisation which will use raster formats; and
 
Encoding: or metafiles which may contain either vector or raster images. Such formats enable the contents to be consistently displayed and used across different computer programs and operating systems. Typically, they support internal metadata, support multi-page images and enable security management. Examples include Adobe PDF and TIFF.
Resolution A measure of the ability to capture detail in the original work, often quantified in pixels per inch (ppi). The optimum resolution depends on the nature of the documents being scanned. Photographs, for example, require much greater resolution than text-based documents.
 
ppi: (pixels per inch) is a measurement of resolution for computer display.
 
dpi: (dots per inch) is often used interchangeably with ppi, but actually refers specifically to measurement of the resolution for computer printers.
Colour resolution or bit depth A measure of the number of colours (or degree of brightness, in greyscale images) available to represent the colours (or shades of grey) in the original document. For example:
 
1 Bit, Black and White or line art: only black and white pixels;
 
Greyscale: black and white in addition to a range of intermediate greys, requiring 8 bits to describe each pixel;
 
8 Bit Colour: uses a palette of 256 colours;
 
24 Bit Colour: a resolution that enables storage of 8 bits of information describing the red, green and blue components of every pixel, thus enabling a much greater palette of colours; and
 
36-48 Bit RBG Colour: uses an extended colour space, creating a much larger file, and requiring storage in formats that explicitly support this colour depth (TIFF or PNG).
Compression Algorithms designed to reduce the size of the image for storage or transmission. Multiple options exist but decisions should be made on the characteristics of the document to be imaged. The two major categories of compression are:
 
Lossy: where information is removed from the stored information during the compression process; and
 
Lossless: where no information is irretrievably lost and where the decompressed object will always appear exactly the same as the original. Examples include LZW or ZIP lossless compression with TIFF files.
 
Newer forms of compression emerging are fractals and wavelets.
Colour management Means of attempting to ensure that the image looks the same across a range of different output devices. Monitors and printers typically use different colour spectrum. The standard for colour representation is the ICC colour management system, which uses a standardised and known 'colour space' based on the human eye and then compares all devices to the known standard.

 
Note:
4. PDF/A is a constrained version of PDF version 1.4 with various proprietary fonts and formats removed, issued as ISO 19005-1:2004.
5. JPEG 2000 is defined in ISO 15444-1:2000.
 
Continuum logo Back to top

Appendix 6: File-Naming Metadata Recommendations6

This appendix is for guidance. It outlines a number of best practices in relation to determining a file naming protocol, particularly for digital images. It relates to Requirement 2.2: All digitised images MUST be assigned metadata to document digitising processes and to support ongoing business processes.
 
A file naming scheme SHOULD be established prior to capture. The development of a file naming scheme should take into account whether the identifier requires machine or human-indexing (or both — in which case, the image may have multiple identifiers). File names can either be meaningful (such as the adoption of an existing identification scheme which correlates the digital file with the source material), or non-descriptive (such as a sequential numerical string). Meaningful file names contain metadata that is self-referencing; non-descriptive file names are associated with metadata stored elsewhere that serves to identify the file. In general, smaller-scale projects may design descriptive file names that facilitate browsing and retrieval; large-scale projects may use machine-generated names and rely on a database for sophisticated searching and retrieval of associated metadata.
 
In general, file names SHOULD:

Directory structure
Regardless of file name, files will likely be organised in some kind of file directory system that will link to metadata stored elsewhere in a database. Production master files might be stored separately from derivative files, or directories MAY have their own organisation independent of the image files, such as folders arranged by date or classification structure, or they MAY replicate the physical or logical organisation of the originals being scanned.
 
The files themselves can also be organised solely by directory structure and folders rather than embedding meaning in the file name. This approach generally works well for multi-page items. Images are uniquely identified and aggregated at the level of the logical object (i.e. a document, a record, a file/folder, etc.), which requires that the folders or directories be named descriptively. The file names of the individual images themselves are unique only within each directory, but not across directories. For example, book 0001 contains image files 001.tif, 002.tif, and 003.tif. Book 0002 contains image files 001.tif, 002.tif, and 003.tif. The danger with this approach is that if individual images are separated from their parent directory, they will be indistinguishable from images in a different directory.
 
Versions
For various reasons, a single scanned object may have multiple but differing versions associated with it (for example, the same image prepared for different output intents; versions with additional edits; layers, or alpha channels that are worth saving; versions scanned on different scanners, scanned from different original media, or scanned at different times by different scanner operators). Ideally, the description and intent of different versions should be reflected in the metadata; but, if the naming convention is consistent, distinguishing versions in the file name will allow for quick identification of a particular image. Like derivative files, this often implies the application of a qualifier to part of the file name. The reason to use qualifiers rather than entirely new names is to keep all versions associated with a logical object under the same identifier. An approach to naming versions should be well thought out; adding 001, 002 to the base file name to indicate different versions is an option; however, if 001 and 002 already denote page numbers, a different approach will be required.
 
Naming derivative files
The file naming scheme should also take into account the creation of derivative image files made from the production master files. In general, derivative file names are inherited from the production masters, usually with a qualifier added on to distinguish the role of the derivative from other files (i.e. "p" for published version, "t" for thumbnail). Derived files usually imply a change in image dimensions, image resolution, and/or file format from the production master. Derivative file names do not have to be descriptive as long as they can be linked back to the production master file.
 
For derivative files intended primarily for web display, one consideration for naming is that images may need to be cited by users in order to retrieve other higher-quality versions. If so, the derivative file name should contain enough descriptive or numerical meaning to allow for easy retrieval of the original or other digital versions.
 
Note:
6. This section is sourced from: US National Archives and Records Administration, 'Technical Guidelines for Digitizing Archival Materials for Electronic Access: Creation of Production Master Files - Raster Images', June 2004.
 
Continuum logo Back to top

Appendix 7: Quality Assurance Recommendations

7
This appendix is for guidance. It covers a number of areas relevant to quality assurance, including guidance on tests, standards and frequency of checking. It is intended as an aid to assist public offices and local authorities in devising their own quality assurance processes. It relates to Requirement 2.3: Quality assurance procedures MUST be defined, documented and implemented.
 
Scanner Operation Quality Control
Scanners SHOULD be tested periodically to monitor their operational performance and check that operating performances are within agreed tolerances as set out by existing standards8 and benchmarks. Results of previous tests SHOULD be used as benchmarks for system performance over time.
 
Simple preventive measures such as ensuring the cleanliness and routine servicing of equipment SHOULD be implemented.
 
Validation of Output
The equipment SHOULD routinely record the number of discrete documents and the number of documents comprising a record (more than one page bundled) that were scanned during a session.
 
Sampling
The frequency of sampling should be determined according to the system usage and expected or anticipated deterioration periods. Advice from the system vendor may assist in determining the frequency period. Initially, it MAY be appropriate to scan a test target every few thousand pages. However, once benchmarks have been established and equipment and processes stabilised, this MAY be reduced to a random sampling of between 5 and 10%.
 
Sample sets
Assemble a sample set of source documents for the purposes of evaluating scanner results against agreed quality criteria. These documents SHOULD be representative of the records to be scanned and SHOULD include examples of source documents whose quality is poor relative to the majority of the sample documents.
 
Test target sheets and detailed tests included in ISO 12653, Parts 1 and 2, MAY be used for black and white scanning.
 
Where digitisation involves colour as an intrinsic part of the record, implementers should consider including a standard colour sheet with the image (such target sheets are commercially available from manufacturers such as Fuji or Kodak, and comply with the ISO 12641 standard). Colour calibration, matching and profiling SHOULD be monitored where colour scanning is used.
 
Quality Criteria for images
Quality criteria for images SHOULD include consideration of overall legibility:

Metadata
Procedures SHOULD specify the checks to be implemented to assess metadata quality assigned to images9.
 
Issues which may be considered in the quality checking of metadata are:

Of particular importance are:

Documentation
Quality control data (such as logs, reports, decisions) SHOULD be captured in a formal system and SHOULD become an integral part of the image metadata at the file or the project level. This data may have long-term value that could have an impact on future preservation decisions.
 
Processes for Re-digitisation
If more than 1% of the total number of images and associated metadata examined in a randomly selected sampling are found to be defective for any of the reasons listed above, the entire output since the last quality check SHOULD be re-inspected. Any specific errors found in the random sampling and any additional errors found in the re-inspection SHOULD be corrected. If less than 1% of the batch is found to be defective, then only the specific defective images and metadata that are found should be redone.
 
Common Faults
Broadly speaking, quality faults can be categorised as 'implementation faults', 'process faults' or 'operator faults'. Implementation faults are those that can be avoided, providing appropriate procedural controls are in place to guide the digitisation. Process faults are normally out of the control of the operator and need to be addressed by a supervisor to the process. Operator faults are the day-to-day faults that are made by the operator as they work.
 
Implementation Faults
There are a number of faults that can be avoided with appropriate specification of procedures to guide the implementation. These include:

Process Faults
There are a wide variety of process faults that can be caused by many problems within the workflow. These problems can include:

Operator Faults
These faults are caused by some form of operator error within the workflow and can include:

7. Much of the material in this section derives from ISO/TR 15801: 2004, Electronic imaging – Information stored electronically – Recommendations for trustworthiness and reliability, and Technical Advisory Service for Images (TASI), In-depth Report: 'Quality Assurance'. http://www.tasi.ac.uk/advice/creating/qassurance.html
8. ISO 12653 – 1: 2000, Electronic imaging – Test targets for the black and white scanning of office documents, Part 1 – characteristics;
ISO 12653-2: 2000, Electronic imaging - Test targets for black and white scanning of office documents, Part 2 – methods of use; and
ISO/TR 15801: 2004, Electronic imaging – Information stored electronically – Recommendations for trustworthiness and reliability.
9. ISO/TS 23081-2:2007 Information and documentation — Records management processes — Metadata for records — Part 2: Conceptual and implementation issues
 
Continuum logo Back to top

Appendix 8: Recommended Staff Skill Sets

This appendix is for guidance. It outlines the types of skill areas and tasks that staff engaged in digitisation should be expected to possess. It is included as an aid to public offices and local authorities in meeting Requirement 2.3: Quality assurance procedures MUST be defined, documented and implemented.
 
Staff training to support digitisation SHOULD include the following areas, and be provided to all levels of skill expected within the organisation:
 

Skill area Tasks
Management - Assessing the business case for digitisation
- Negotiating purchase, ongoing service and maintenance of equipment and supplies
Business Analysis - Defining the workflow for the digitisation process
- Defining the integration of digitised records into existing business systems/workflows
- Selecting image format
- Determining image enhancement requirements
- Identifying information architecture for business-process support
Systems Analysis - Selection of scanner hardware
- Defining storage requirements
- Integration of computer hardware, imaging equipment and software
- Integration of digitisation requirements into existing organisational IT infrastructure
- Compliance with national and organisational IT standards
- Testing of configurations
- Ongoing support of digitisation equipment (where necessary)
Recordkeeping - Definition of policies and procedures to ensure authenticity and integrity of digital images
- Ensuring legislative compliance
- Integration with organisational records and business systems
- Integration with existing classification and disposal regimes
- Defining file naming conventions
- Defining and implementing disposal process
- Defining metadata
- Monitoring of quality of metadata
- Managing the source records after digitisation
Equipment Operations - Operating scanners
- Applying any defined selection criteria
- Carrying out quality checking on digital records
- Adding metadata to digital records

 
Continuum logo Back to top

Appendix 9: Further Resources

This appendix is included for guidance. A plethora of advice and standards exist for further reference; however, much of the information is specific to particular types of digitisation – either project-based digitisation for archival quality, or vendor-driven commercial advice. The context of the advice should always be indicated within the project methodology documentation.
 
The resources listed here represent those most helpful in the development of this Standard, or assessed as useful for public offices' and local authorities' reference.
 
ISO 12641: 1997, Graphic technology – Prepress digital data exchange. Colour targets for input scanner calibration.
 
ISO 12653 – 1: 2000, Electronic imaging – Test targets for the black and white scanning of office documents, Part 1 – characteristics.
 
ISO 12653-2: 2000, Electronic imaging - Test targets for black and white scanning of office documents, Part 2 – methods of use.
 
ISO/IEC 15444-1: 2000, Information technology -- JPEG 2000 image coding system -- Part 1: Core coding system [using lossy compression].
 
ISO 15489-1: 2001, Information and documentation -- Records management -- Part 1: General.
 
ISO/TR 15489-2: 2001, Information and documentation -- Records management -- Part 2: Guidelines.
 
ISO/TR 15801: 2004, Electronic Imaging – Information stored electronically – Recommendations for trustworthiness and reliability.
 
ISO/DIS 19005-1: 2004, Document management – Electronic document file format for long-term preservation – Part 1: Use of PDF 1.4 (PDF/A-1).
 
ISO/TS 23081-2:2007 Information and documentation — Records management processes — Metadata for records — Part 2: Conceptual and implementation issues.
 
PDF Tools AG, PDF/A – 'The Basics' – Understanding PDF White Papers, v. 1. February 7, 2006. http://www.pdf-tools.com/public/downloads/whitepapers/whitepaper-pdfa.pdf
 
Queensland State Archives. 'Guidelines for the Digitisation of Paper Records', v. 1, May 2005. http://www.archives.qld.gov.au/publications/digitisation/digit1.pdf
 
Queensland State Archives, 'Digitisation Disposal Authority' April 2006. http://www.archives.qld.gov.au/publications/digitisation/QSA6814%20Dispo...
 
New Zealand State Services Commission, 'Trusted Computing and Digital Rights Management Principles and Policies', September 2006. http://www.e.govt.nz/policy/tc-and-drm/principles-policies-06/tc-drm-090...
 
Technical Advisory Service for Images (TASI), In-depth Report: 'Quality Assurance'. http://www.tasi.ac.uk/advice/creating/qassurance.html
 
Technical Advisory Service for Images (TASI). http://www.tasi.ac.uk
 
United States National Archives and Records Administration, 'Technical Guidelines for Digitizing Archival Materials for Electronic Access: Creation of Production Master Files – Raster Images', June 2004. http://www.archives.gov/research/arc/digitizing-archival-materials.pdf
 
 
Issued January 2007
 
 
Continuum logo Back to top