41
Sensitive:NSW-Government-DLM-ONLY © State of New South Wales 2016 Page 1 of 37 Request for Quote PART B: STATEMENT OF REQUIREMENTS Supply Panel for the Digitisation of Heritage Materials Category 1 Printed Materials RFP Number: Location: Annexures A - Project Status Report B - Quality Assurance Certificate C - Acceptance Criteria/Quality Assurance Report D - Collection Metadata Spreadsheets E Sharpness Comparison Text file .pdf .pdf .pdf .csv .jpg

Request for Quote PART B: STATEMENT OF REQUIREMENTS · 2020. 8. 26. · Materials collection, including books, serials, journals, magazines, pamphlets, catalogues and ephemera. Inventory

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

  • Sensitive:NSW-Government-DLM-ONLY

    © State of New South Wales – 2016 Page 1 of 37

    Request for Quote

    PART B:

    STATEMENT OF REQUIREMENTS

    Supply Panel for the Digitisation of Heritage Materials

    Category 1 – Printed Materials

    RFP Number:

    Location:

    Annexures

    A - Project Status Report B - Quality Assurance Certificate C - Acceptance Criteria/Quality Assurance Report D - Collection Metadata Spreadsheets E – Sharpness Comparison Text file

    .pdf

    .pdf

    .pdf

    .csv

    .jpg

  • Sensitive:NSW-Government-DLM-ONLY

    © State of New South Wales – 2016 Page 2 of 37

    1. DOCUMENT CONTROL

    Document Version:

    TRIM File No:

    Date:

    Status:

    Confidentiality:

    Name Department & Job Title Date Approved Approval Details

  • Sensitive:NSW-Government-DLM-ONLY

    © State of New South Wales – 2016 Page 3 of 37

    Table of Contents

    1. TECHNICAL EVALUATION.................................................................................................................... 4 Overview ............................................................................................................................................ 4 Objectives .......................................................................................................................................... 4 Purpose ............................................................................................................................................. 4 Project Scope ................................................................................................................................... 4 Collection Material types ................................................................................................................. 4 Inventory ............................................................................................................................................ 4 Location and Environment .............................................................................................................. 5 Review of Premises ......................................................................................................................... 5 Data Security and Management .................................................................................................... 5

    Work Health and Safety .................................................................................................................. 5 Environmental Management .......................................................................................................... 5 Hardware and Software .................................................................................................................. 5 Project Management ....................................................................................................................... 5

    2. CARE AND HANDLING OF ORIGINAL MATERIALS ......................................................................... 6 Delivery of the Collection Materials ............................................................................................... 6 Inventory Requirements .................................................................................................................. 6 Collection Handling Requirements ................................................................................................ 6 Handling of USB/Flash Hard drives............................................................................................... 6 Storage .............................................................................................................................................. 6 Bound and Fragile Material ............................................................................................................ 7 Return of Collection Materials and Deliverables ......................................................................... 7 Boxing and Transportation.............................................................................................................. 7 Condition Reports on Return .......................................................................................................... 7

    3. SPECIFICATIONS ................................................................................................................................... 8 Extent of Capture ............................................................................................................................. 8 Protective Covers ............................................................................................................................. 8 Image Resolution ............................................................................................................................. 8 Image Sharpness ............................................................................................................................. 9 Colour Management ...................................................................................................................... 10 Archival Master file specification .................................................................................................. 11 Cropping and De–skewing of Archival Masters ......................................................................... 11 Title and Resolution Scans ........................................................................................................... 14 Checksum Generation ................................................................................................................... 18

    4. METADATA ............................................................................................................................................ 20 External Metadata (CSV) .............................................................................................................. 20 Embedded Metadata ..................................................................................................................... 22 TIFF Tags ........................................................................................................................................ 26

    5. QUALITY ASSURANCE ....................................................................................................................... 29 6. FILE NAMING AND FOLDER STRUCTURES ................................................................................... 30

    File and Folder Naming Schema ................................................................................................. 30 File Directory Structure – Archival Master Deliverables ........................................................... 30 File Naming and Digital ID Schema ............................................................................................ 32

    7. SERVICE LEVELS ................................................................................................................................. 33 Services, responsibility and measure ......................................................................................... 33

    8. DELIVERABLES .................................................................................................................................... 35 Matrix of deliverables mapped against each service: ............................................................... 35

    9. QUALITY ASSURANCE AND ACCEPTANCE OF DELIVERABLES ............................................. 36 Quality Assurance Process .......................................................................................................... 36 Technical Evaluation Assessment Criteria ................................................................................. 36

    10. DEFINITIONS AND ACRONYMS ........................................................................................................ 38 11. STANDARDS REFERENCES & VERSION REQUIREMENTS ...................................................... 40 12. LIST OF ANNEXURES .......................................................................................................................... 41

    Annexure A – Project Status Report for completion by the Contractor .............................................. 41 Annexure B – Quality Assurance Report for completion by the Contractor ...................................... 41 Annexure C – Deliverables Acceptance Criteria for completion by the State Library of NSW ....... 41 Annexure D – Collection Metadata Spreadsheets for completion by the Contractor ...................... 41 Annexure E – Sharpness Comparison Text File.jpg ............................................................................. 41

  • © State Library of New South Wales – 2016 Page 4 of 36

    2. TECHNICAL EVALUATION

    Overview

    This document sets out the minimum requirements of the State Library of New South Wales (SLNSW) in relation to the Technical Evaluation. The Respondent must indicate in Returnable Schedule 1 of their response the extent to which it complies with the requirements set out in this document and Annexures.

    Objectives

    The SLNSW is seeking to identify suitably qualified and experienced companies to provide digitisation services (including equipment, personnel and expertise) to digitise a selection of the SLNSW Printed Materials Collection.

    Contractors short-listed from the desktop evaluation will be given a test batch of items from the Printed Materials collection.

    The Contractor will be required to digitise all titles in the test batch, deliver Archival Master files as well as any administrative and technical metadata requested.

    Purpose

    The purpose of digitising a test batch as part of the evaluation process is to mitigate the risk to the State Library of a supplier being accepted onto the digitisation panel based purely on a well written Proposal response as opposed to actual demonstrated capability in being able to perform the required services to the standards required by the SLNSW.

    Project Scope

    The Services sought from the Contractor include:

    Digitisation of the Test Batch of collection materials to create Archival Masters. (as detailed in Section 3.6)

    Addition of specific metadata to Archival Masters. (Section 4.2)

    Generation of checksum data for each digital file created. (Section 3.9)

    Supply of specific administrative metadata. (Section 4.1.2)

    Quality assurance of the Deliverables produced from the services described in sections. (Section 5)

    File transfer of the Deliverables and packing/return of Collection Materials (originals and digitised files). (Section 2)

    Retention of a copy of the digitised files until the integrity and quality acceptance of the transferred files has been confirmed by State Library. (Section 7.1.6)

    Collection Material types

    The Collections Materials identified for this Technical Evaluation consist of items from the Printed Materials collection, including books, serials, journals, magazines, pamphlets, catalogues and ephemera.

    Inventory

    A detailed inventory will be provided to the Contractor with the shipment of Collection Materials.

  • © State Library of New South Wales – 2016 Page 5 of 36

    Location and Environment

    The Contractor’s premises where the digitisation work is to be undertaken must, as a minimum, be in a building that is secure, well maintained, regularly cleaned and fitted with a fire protection system.

    No smoking, food or drink should be allowed in the work or storage area. The work and storage areas must be lockable, dust free, cool and dry with no extreme fluctuations of temperature or humidity. The Collections Materials must be protected from direct sunlight.

    The Contractor must take all reasonable care and prudence in all matters which may touch upon the good care of State Library’s Collection Materials while in the Contractor’s possession.

    Review of Premises

    Short-listed Contractors who are successful in progressing to the Technical Evaluation stage will have an inspection of their production facilities carried out by SLNSW technical staff before the Test Batch is delivered for the Technical Evaluation.

    Data Security and Management

    The State Library prefers that the Contractor has current certification for:

    ISO 9001 Quality Management Systems and

    ISO 27001 Information Security Management.

    Work Health and Safety

    The successful Respondent must conduct its own assessments & investigations regarding the work Health and Safety of all persons affected by the services delivered under this project. All equipment used must be in safe and reliable condition and meet all relevant safety requirements, regulations and standards.

    The successful Respondent will be required to comply with Work Health and Safety Act 2011, any related regulations and codes of practice and any directions, policies and guidelines given by the Library.

    The Libraries Work Health and Safety Policy can be downloaded from:

    www.sl.nsw.gov.au/about/policies/docs/work_health_safety_policy_2012.pdf

    Environmental Management

    The Library is committed to procuring sustainable products, works and services where possible. The successful Respondent will be required to comply with any directions, policies and guidelines given by the Library. If requested by the Library, the successful Respondent must submit evidence of its environmental management system.

    Hardware and Software

    The Contractor must maintain its hardware and be able to show a record of this. The Contractor may be required to report in writing to the Library any intended hardware and software upgrades prior to its implementation.

    Project Management

    The Contractor must provide a single point of contact for managing all operational aspects during the Technical Evaluation phase. This person will act as the Contractor’s Project Manager, and must have appropriate authority to make day-to-day project related decisions.

    http://www.sl.nsw.gov.au/about/policies/docs/work_health_safety_policy_2012.pdf

  • © State Library of New South Wales – 2016 Page 6 of 36

    3. CARE AND HANDLING OF ORIGINAL MATERIALS

    Delivery of the Collection Materials

    Test batch materials will be hand delivered to the short-listed Contractors following the site visit by

    State Library technical staff. If required, materials may be couriered to the candidate by the SLNSW’s

    preferred courier.

    Inventory Requirements

    Upon receipt of the collection materials, the contractor must:

    Check that no damage has occurred to the packaging or the items during transport. Any damage must be reported immediately to the SLNSW Project Manager.

    Confirm to the SLNSW Project Manager that the material received matches the inventory provided by the SLNSW.

    Collection Handling Requirements

    The contractor must handle the collection material with care at all times. If any issues arise during the handling, for example, pages become loose in a volume, please contact the SLNSW’s Project Manager for advice.

    The following guidelines apply:

    Handle with clean hands at all times.

    Keep food and drink away from collection material.

    Maintain clean scanning equipment and work surfaces.

    Items should not be defaced or damaged by the use of pens, stamps, liquid paper, sticky tape, and cleaning products nor torn/cut out of volumes.

    Avoid using metal paper clips, adhesives notes, rubber bands, Blu-Tac or acidic papers.

    Use acid free paper for flagging material if needed.

    Protect items from direct sunlight and extremes of temperature and humidity.

    Handling of USB/Flash Hard drives

    USB/flash hard drives are typically damaged because of rough handling, shock and vibration, or electrostatic discharge (ESD). To avoid any damage during shipment, it is important that USB/flash hard drives are handled and packaged carefully.

    While working with the USB/flash hard drives:

    Avoid extreme temperatures and humidity

    Do not stack USB/flash hard drives

    Do not place anything on top of a USB/flash hard drive

    Do not drop, bump or shake USB/flash hard drives.

    Storage

    Collection items must be stored securely while on the contractor’s premises. Secure storage includes:

    While not in use, material must be stored in a room where access is limited to contractor staff only.

    Storage facilities must have appropriate fire suppression systems.

    The contractor must have a disaster management strategy in place.

    A stable temperature and humidity range that is appropriate for the collection material type. For paper materials this is 23°C ±2°C and 50% ±10% relative humidity. Please note that materials stored within packaging material will maintain stable conditions more easily.

  • © State Library of New South Wales – 2016 Page 7 of 36

    Bound and Fragile Material

    If the collection material is bound or fragile and difficulties arise with scanning, due to issues such as foldouts, uncut pages or the binding being too tight to allow the appropriate capture of information, please cease work on the specific item and consult with the SLNSW Project Manager for further instruction.

    Return of Collection Materials and Deliverables

    The Contractor must advise the State Library when it has finished digitising the Collection Materials. The SLNSW will provide instructions for packing and transportation of the Collections Materials. The Contractor must ensure that the Collections Materials are repacked for transport according to the instructions provided by the SLNSW and the delivery report is signed and included.

    Packages/boxes must be clearly labelled with the SLNSW name and delivery address:

    State Library of New South Wales Loading Dock, Hospital Road Sydney NSW 2000

    The SLNSW will coordinate the return of the Collection Materials via its preferred courier.

    The Deliverables must be delivered using a USB/Flash Hard Drive/s, The SLNSW will provide a suitable USB/Flash Hard drive/s for this purpose. The USB/Flash Hard drive/s must be packaged for return, according to SLNSW instructions, to ensure that they are not damaged in transit.

    Proposals by Contractors to use other delivery methods e.g. FTP will be assessed and considered.

    Boxing and Transportation

    The Library will cover all costs associated with the shipping of Collection or project item/s including any crating, packing, transportation.

    Appropriate transport arrangements must be agreed between parties for all collections.

    The collection material should be transported via courier or hand delivered.

    Retain all packing material and pack for return according to packing instructions in same

    packing box. Ensure material is packaged securely in the boxes and is surrounded by

    appropriate packing material to ensure there is no movement during transit.

    Condition Reports on Return

    Upon return of the Collection Materials, the SLNSW will check:

    that no damage has occurred to the Collection Materials,

    that no damage has occurred to the packing cases during transport,

    the inventory of the Collection Materials to ensure that all items are accounted for, including all required documentation,

    the operation and contents of the USB/Flash Hard drive/s, and

    the integrity of the digitised files on the USB/Flash Hard drive/s.

  • © State Library of New South Wales – 2016 Page 8 of 36

    4. SPECIFICATIONS

    Extent of Capture

    The SLNSW has set the following guidelines and requirements for image capture:

    Volumes (i.e. books, serials, magazines, journals, catalogues, pamphlets etc.) are to be digitised as single individual pages unless the opposing pages are designed to be viewed as a whole i.e. double pages spreads or centrefolds

    Images are to be captured to show the edges of the page

    A single Title Scan is required at the beginning of all sequences of digitised images (See Section 3.8.1)

    Title and Resolution scan/s (See Section 3.8) are to be captured at the start of scanning of any volume

    Each page will be saved as an individual Archival Master file (in TIFF 6.0 format)

    Covers, endpapers and blank/spacer pages are also to be digitised as Archival Masters

    The order of pages delivered will always follow the original sequence of pages in the volume

    Foldouts, inserts, etc. that need to be digitised on other devices must be reinserted in final sequence of Archival Master files set, prior to any further workflow, creation of derivatives or subsequent processing. All subsequent action e.g. OCR, must only occur after the volume has been fully digitised with all individual components assembled in the correct sequence

    Images must represent page orientation as present in original volume, i.e. vertical orientation of images and diagrams in original volumes must be maintained

    Blank pages are those which have no printed or handwritten text, stamps or watermarks

    Blank pages are to be captured where required, to ensure that page order (i.e. left hand, right hand pages) sequences are maintained

    Multiple or long runs of blank pages are not to be captured

    File naming and folder structure to follow SLNSW schema (See Section 6)

    Devices used for digitisation should be colour profiled and synchronised to record accurate date and time.

    Important Notes:

    Contractors should note that a substantial proportion of the Library’s printed material collection is unsuitable to be digitised on flatbed scanner or on devices that press the volume flat under glass. In many cases digitisation will require devices with V-shaped cradles

    It is the SLNSW’s overall preference to show full edges when capturing pages. Scanning devices which use clips/clamps etc. to hold pages flat may obscure the page edge and detract from the authentic representation of the physical object.

    The decision whether to allow the use of devices with clips (and subsequent cropping of page edges) will be determined and specified for each collection type that is offered for digitisation.

    Protective Covers

    Protective covers (i.e. Mylar book covers), where present, are not to be removed and are to be captured as part of the digitising process. When capturing images to show the edges of pages the mylar is not to be considered as the edge. Only the actual book covers or pages should to be used as a guide for this requirement i.e. cropped to show a 3-5 mm border around the edge of the page or cover

    Image Resolution

    Pages are to be scanned at 400 ppi as a minimum (optical resolution). Up-sampling or interpolation is unacceptable.

    Oversize volumes, foldout maps etc. must be captured to clearly resolve the smallest meaningful element on the page. The Library’s preference is that they should also be captured at 400ppi minimum. SLNSW will consider variation from this on a case by case basis.

  • © State Library of New South Wales – 2016 Page 9 of 36

    The Contractor will need to contact the Library for instructions when encountering unusual or non- standard page layouts or oversized volumes or foldouts.

    Image Sharpness

    It is expected that all digitised material will be in focus and fit for purpose. It is expected that the digitised files will be used as a basis for the future production of derivatives such as searchable text created by automated Optical Character Recognition (OCR) technology, text and images delivered by web based technologies and potently print based products.

    To achieve the SLNSW expectation for focus - Annexure E, a sharpness file called Sharpness Comparison Text File.jpg should be referenced.

  • © State Library of New South Wales – 2016 Page 10 of 36

    Colour Management

    The SLNSW uses the QP 101 card to enable colour management of digital files created when copying physical collection materials. The card also provides scale and size information.

    The QP101 card is to be used at the beginning of digitising a volume/sequence of pages. It is to be placed on the focus chart when capturing the Resolution Scan (Section 3.8.2)

    See figure below:

    Image 9: Example of QP101 Card

    The RGB values are as follows:

    dark grey patch is 68 (+/- 10),

    mid grey patch is 115 ( +/- 10) and the

    white patch is 225 (+/- 10)

    The values must be equal (neutral) for each RGB colour channel i.e. 115/115/115

  • © State Library of New South Wales – 2016 Page 11 of 36

    Archival Master file specification

    Each cover, end paper, title page, text page and foldout scanned will be saved as an individual Archival Master file.

    Technical specifications for Archival Master files are:

    24 bit full colour, uncompressed TIFF 6.0 files

    Colour space: Adobe RGB (1998)

    Fidelity to colour and tonality of original item is essential

    Volumes are to be digitised on a pale neutral background. This should be visible around page edges

    All images must show edges of pages (this is also required for covers, foldouts etc. that are digitised on different devices), de-skewed

    Metadata must be embedded in TIFF file headers, as specified in Sections 4.2 and 4.3

    Cropping and De–skewing of Archival Masters

    The cropping of the Archival Masters must create an image that is:

    Cropped to show a 3-5 mm border around the edge of the page or cover

    Care is to be taken so that cropping and de-skewing produce an image with an evenly spaced border

    The edge of page adjacent to gutter/spine is to be closely cropped but still showing line of the spine

    See Image Figures 1 and 2 for visual examples of cropping

    Where required the Archival Master is to be de-skewed so that lines of text are within +/- 10 of horizontal

    If the text is not printed square to the page edge then priority should be given to aligning the page edge rather than the text within the page.

    .

    Image Figure 1: Example of the cropped Archival Master file from a book, showing page edge, width of border, spine and neutral background

  • © State Library of New South Wales – 2016 Page 12 of 36

    Image Figure 2: Example of the cropped Archival Master file from a Pamphlet The Master file is to be cropped to just outside the page edge.

    Image Figure 3: Example of the cropped Archival Master file from a Pamphlet

  • © State Library of New South Wales – 2016 Page 13 of 36

    Image Figure 4: Page corner detail of cropped Archive Master showing page edge, border proportions and background sheet colour

  • © State Library of New South Wales – 2016 Page 14 of 36

    Title, Resolution & Foldout Scans (Reference Scans)

    The first digital image in every set of scanned page images, whether a complete physical volume or a subdivision of a physical volume folder, will be a Title Scan.

    The next two images will be Resolution Scans, one for the Left side then one for the Right side. These scanned images allow the Library to identify the volume and evaluate scanner performance, focus, and exposure and image clarity for that scanning session.

    When a Foldout is scanned there will be two tifs generated:

    1. Foldout + QP card and

    2. Cropped Foldout

    These files will also be known as Reference Files and should be named with an ‘r’ in the file name to separate them from the Master Files.

    Title Scan

    A single Title Scan is required at the beginning of all sequences of images followed by resolution scans from both cameras.

    The Title Scan must include the following elements:

    A clearly typed text block showing:

    Call Number

    Match Point

    Digital ID as per SLNSW schema

    Title of volume/book

    Date of digitisation

    Service Provider Name

    SLNSW Notes (if any)

    SLNSW & Digital Excellence Program Logo (will be supplied)

    A Title Scan will always be the first image in a set. It should be named as in the following: Digital ID_4 Digit Sequence Number_Type (r= reference).File Extension

    Example: b08851_0001_r.tif

    Please see 6.3 File Naming and Digital ID Schema for more detail.

  • © State Library of New South Wales – 2016 Page 15 of 36

    Image Figure 5: Example of Title Scan

    Resolution Scans

    The next two images captured when digitising a volume will be Resolution Scans, one for the Left side then one for the Right side. These scanned images allow the Library to evaluate scanner performance, focus, and exposure and image clarity for that scanning session.

    The Resolution Scans must be carried out for each camera/scanning device used and needs to

  • © State Library of New South Wales – 2016 Page 16 of 36

    include the following elements:

    An indication of which camera is used (i.e. a tag indicating RHS or LHS, as appropriate, must be included in the scan)

    A scan of the Association of Information and Image Management (AIIM) Scanner Test Chart #2, AIIM X441 (or equivalent approved by SLNSW)

    A QP101 grey scale card (or equivalent approved by SLNSW)

    A ruler or scale (if not included in the QP101 card) The Resolution Scan will always be the second and third images in a set. They should be named as in the following examples:

    1. b08851_0002_r.tif 2. b08851_0003_r.tif

    Image Figure 6. Resolution Scan LHS Camera Image Figure 7. Resolution Scan RHS Camera

    Foldout Scans

    Any page or foldout page that is digitised on a device different to the regular page images will be a Foldout Scan.

    A Foldout Scan must be digitised in a colour managed workflow such that it maintains fidelity to the colour and tonality of original item.

    When a Foldout is identified during scanning a place-holder should be inserted in the page sequence to indicate the location of the foldout.

    Once the Foldout is scanned two files need to be created and merged back into the sequence to replace the place-holder scan. These files are:

    1. The Foldout with a QP card (a Reference file) 2. A cropped version of the Foldout (a Master file)

  • © State Library of New South Wales – 2016 Page 17 of 36

    These files need to be named using the same digital ID and sequence number but with a unique letter

    as outlined in 6.3 File Naming and Digital ID Schema. For example if page 21 in a book sequence was

    a Foldout then the two Foldout scans created would be named:

    1. b08851_0021_r.tif (a Reference file) 2. b08851_0021_m.tif (a Master file)

    Example of a Foldout with QP card Named with _r: b08851_0021_r.tif

    Reference File

    Example of a cropped Foldout named with _m: b08851_0021_m.tif Master File

  • © State Library of New South Wales – 2016 Page 18 of 36

    Checksum Generation

    The Library requires a Checksum Manifest to be provided in the root directory of each delivery returned to the Library on the external hard disk drive. The Checksum Manifest is a structured listing of checksum information for all the files in the delivery.

    The Checksum Manifest must:

    be a CSV

    list the full path and file name for each file in the batch,

    list the checksum type (SHA-1),

    list the checksum value for each file

    list the name and version of the software used to create the checksum.

    Checksum Manifest file naming

    It is expected that the checksum manifest will be generated to match the delivery of files and will be given an appropriate file name:

    Components of a Checksum Manifest filename: Example:

    Project Name 3 letter code _PMA or DSM or PAM

    Vendor Name 4 letter code _DCIT

    Reverse Date YYYYMMDD 20160610 ( at 10/06/2016 for example)

    Sequential Delivery

    Number

    3 digit number _001 (then 002 etc to match the delivery number)

    Content Type Checksum _checksum

    File Extension .CSV, .TXT .csv

    Example of a Checksum Manifest file name: DSM_ DCIT_20160610_001_checksum.csv

    Instructions for specific file naming of the checksum manifest will be provided with documentation that accompanies each batch of Collection Materials.

    Manifest file naming requirements:

    The file name must use “_” (underscore) to separate component parts of file name

    The file name must not include spaces or other diacritics

    Checksum Manifests should be stored in the Level 1 (root) Project ID directory for each batch.

    N.B. While checksum manifests are typically text or CSV files, some checksum generation software will apply the hash algorithm used, as the file extension to a checksum manifest text file. These files are still considered suitable text files. For example: checksum_manifest_filename.md5 instead of checksum_manifest_filename.text

    In addition to the checksum manifest some checksum software e.g. Shotput Pro will also output an Offload Log. If possible this should also be supplied as a .csv or.txt file

    Tools for Generating Checksums

    Depending on the Operating System used by the vendor, the SLNSW suggests the following checksum generation tools.

    For PC:

    ● Shotput Pro ● ExactFile ● Hash My Files ● TeraCopy 3.0 beta

    For Mac:

    ● Shotput Pro

  • © State Library of New South Wales – 2016 Page 19 of 36

    Checksum Manifest File Formats

    Depending on which checksum software is selected, the following file formats for the checksum manifests are expected to be generated from each of these tools:

    ● CSV - Hash My Files, Shotput Pro ● TXT - ExactFile ● XML - Shotput Pro ● XSLX - Hash My Files

    The specific file format will be negotiated with the SLNSW once the software tool for generating checksums has been nominated by the Contractor.

  • © State Library of New South Wales – 2016 Page 20 of 36

    5. METADATA

    The SLNSW requires a variety of metadata relating to the creation of digitised images to allow for the long term management and sustainability.

    The Library will provide a CSV file containing metadata for each batch of collection items to be processed. The detail of the supplied metadata is outlined in Section 4.1.1 Metadata provided by the Library.

    The CSV file is to be used by the Contractor to:

    ● incorporate specific metadata into new fields in the provided CSV for each item digitised as outlined in Section 4.1.2 Metadata required from Contractor and

    ● embed metadata into the XMP metadata (Section 4.2.1) and TIFF tag metadata (Section 4.3) into the captured image TIFF header as outlined in Section 4.2 Embedded Metadata

    External Metadata (CSV)

    This section covers what is supplied in the CSV file and how this data is to be supplemented and used by the Vendor.

    The CSV will be supplied for each batch of 25 books and will be named according to the Batch Number or Work Order Number for that Batch e.g. D47519.csv

    The supplemented CSV should be supplied using the same name and should be placed at the 3rd Level

    of the Delivery folder as outlined in 6.2 File Directory Structure – Archival Master Deliverables

    Metadata Provided by the Library

    The Library will provide the following metadata in a CSV file for each batch of collection items. Each Volume will be represented on a single row of the CSV file.

    Field Label Definition

    BATCH Work Order Number supplied by the Library

    DIGITAL ID Digital ID supplied by the Library

    MATCH POINT 019 field subfield a from the bibliographic record – Libraries Australia number used as match point for overlay

    CALL NO. Call Number from the Millennium item record

    VOLUME Volume field from the Millennium item record if present

    TITLE 245 field from the bibliographic record

    AUTHOR 100 field from the bibliographic record if present

    EDITION 250 field from the bibliographic record if present

    PLACE 260 field subfield a from bibliographic record

    PUBLISHER 260 field subfield b from bibliographic record

    DATE 260 field subfield c from bibliographic record. May also include copyright information or multiple dates.

    MAT TYPE Code from MATTYPE fixed field in Millennium

  • © State Library of New South Wales – 2016 Page 21 of 36

    - - no information

    “a” – printed material

    “b” – archival material

    “c” – music, printed

    “d” – music MSS

    “e” – map, printed

    “f” – map, MSS

    “g” – film, video

    “j” – sound, music

    “i” – sound, word

    “k” – picture

    “m” – computer file

    “o” – kit

    “p” – mixed material

    “r” – realia (object)

    “t” – manuscript

    “y” – digitised

    “z” – online

    LANG ISO 639 code for the language of the book, e.g.

    “eng” – English

    “fre” - French

    “spa” - Spanish

    “dut” - Dutch

    “dan” – Danish

    “ger” – German

    PAGINATION 300 field subfield a – number of pages in the book, as printed in their respective sequences. “Leaves” differ from “pages” in that leaves are printed on one side not both sides.

    ILLUSTRATIONS 300 field subfield b – an indication about whether the book is illustrated, has maps, whether the illustrations are in colour, and the presence of other graphical content

    DIMENSIONS 300 field subfield c – usually an indication in cm of the height of the book

    BARCODE Barcode field from the Millennium item record

    RIGHTS Copyright statement supplied by the Library

    CONTRIBUTOR Contractor acknowledgment statement

    SL NOTES [as required]

    Metadata required from Contractor

    The Contractor is required to incorporate selected metadata into the provided CSV for each item digitised.

    Field Label Definition

    CAPTURE DATE Date of capture in ISO 8601 format

    CAPTURE DEVICE

    OPERATOR ID Identifier for the Operator

  • © State Library of New South Wales – 2016 Page 22 of 36

    EQUIPMENT ID Serial number

    DIGITAL ID SEQUENCE Example of this

    EXCEPTION NOTE Exception reports for missing or out of sequence pages

    CONDITION NOTE Damage and condition notes

    PAGE COUNT Actual page count based on pages digitised (excluding Resolution Scans)

    CONTRACTOR NOTES [as required]

    QA CHECKS General note after digitisation

    Embedded Metadata

    The Library requires that a small amount of metadata is embedded into Archival Master TIFF files, along with metadata that is automatically produced during digitisation. This is to assist with the long term management of the images and is required within the XMP and TIFF tag metadata.

    The metadata to be embedded is supplied in the CSV in the following fields.

    MATCH POINT (Unique ID)

    RIGHTS

    CONTRIBUTOR (Contractor)

    XMP Metadata

    The Extensible Metadata Platform (XMP) is an ISO standard, originally created by Adobe, to create, process and exchange standardised and custom metadata for digital documents and data sets.

    A typical process to update XMP data is to create an XMP file containing the required data from the supplied CSV and then apply that to a collection of images using an Adobe product such as Photoshop or Bridge or any of a number of open source or proprietary metadata editing tools.

    A list of metadata editing tools can be found at: https://en.wikipedia.org/wiki/Extensible_Metadata_Platform#Support_and_acceptance.

    Alternatively a command line tool such as Exiftool can be used to populate the required metadata e.g.

    exiftool -identifier="Match point from CSV" -contributor="Contributor from CSV followed by

    Contractor name" -rights="Rights from CSV" -documentname="Match point from CSV" -

    artist="Contributor from CSV followed by Contractor name" -copyright="Rights from CSV" -

    overwrite_original -P filepath\*.tif

    Metadata to be embedded in XMP format should use the following namespaces:

    EXIF (namespace: http://ns.adobe.com/exif/1.0/)

    TIFF (namespace: http://ns.adobe.com/tiff/1.0/)

    DC (namespace: http://purl.org/dc/elements/1.1/)

    http://ns.adobe.com/exif/1.0/http://ns.adobe.com/tiff/1.0/

  • © State Library of New South Wales – 2016 Page 23 of 36

    CSV fields DC elements Name

    Type Description Value

    Contributor dc:contributor Unordered array of ProperName

    DCMI definition: An entity responsible for making contributions to the resource. DCMI comment: Examples include a person, an organization, or a service. Typically, the name of a contributor should be used to indicate the entity.

    XMP addition: XMP usage is a list of contributors. These contributors should not include those listed in dc:creator.

    CONTRIBUTOR (supplied by Library in CSV)

    followed by Contractor’s name

    Match point

    dc:identifier Text DCMI definition: An unambiguous reference to the resource within a given context.

    DCMI comment: Recommended best practice is to identify the resource by means of a string conforming to a formal identification system.

    MATCH POINT

    (supplied by Library in CSV)

    Rights dc:rights Language

    Alternative

    DCMI definition: Information about rights held in and over the resource.

    DCMI comment: Typically, rights information includes a statement about various property rights associated with the resource, including intellectual property rights. XMP addition: XMP usage is a list of informal rights statements, given in various languages.

    RIGHTS

    (supplied by Library in CSV)

    XMP example

    The following is an example of well-formed XMP. Content in the example is supplied for demonstration purposes only. The sections highlighted in bold green text show the correct encoding for the values that will require embedding as outlined in 4.2.1. and in 4.3 TIFF tags.

  • © State Library of New South Wales – 2016 Page 24 of 36

    IQ180, User Firmware: 2.01, Main=2.35, Boot=1.00, FPGA=1.12,

    DPLD=1.01, Util=1.28, CIF=4.23.1, TGEN=2.00.32768, Fonts=0.05,

    Images=0.05.18

    Mamiya MACRO 120mm f/4.0 D

    FP001138

    Contributor from CSV followed by Contractor name

    Ellis, Alfred William Leslie 1914-19-44 image/tiff Match point value from CSV eng

    Rights value from CSV Alfred William Leslie Ellis, Group Captain,

    papers, 1914-1944, mainly concerning the 4th Squadron, Australian Flying

    Corp textual record 13301/2002 2013-06-28T14:41:00 0220 1/2 10/1 3 120/1 50

    http://www.w3.org/1999/02/22-rdf-syntax-nshttp://ns.adobe.com/exif/1.0/aux/http://purl.org/dc/elements/1.1/http://ns.adobe.com/exif/1.0/

  • © State Library of New South Wales – 2016 Page 25 of 36

    255 5555 6105 1 1/1 FP001138 3 2013-06-28 Adobe RGB (1998)

    Contributor from CSV followed by Contractor name Match point value from CSV 8 8 8 1

    Rights value from CSV 6105 5555 iCam Archive GUARDIAN 1 2 1 2 3 CCS docWORKS v6.6-1.19 400/1 400/1

    2013-07-02T20:24:12 CCS docWorks v6.6.1.19 2013-07-02T20:24:12 2013-07-02T20:24:12

    xmp.iid:79543E44-01E3-E211-BA37-0022195FC2E8

    saved

    xmp.iid:79543E44-01E3-E211-BA37-

    0022195FC2E8

    CCS docWorks v6.6.1.19

    http://cipa.jp/exif/1.0/http://ns.adobe.com/photoshop/1.0/http://ns.adobe.com/xap/1.0/http://ns.adobe.com/xap/1.0/sType/ResourceEventhttp://ns.adobe.com/xap/1.0/mm/

  • © State Library of New South Wales – 2016 Page 26 of 36

    2013-07-02T20:24:12 xmp.iid:79543E44-01E3-E211-BA37-0022195FC2E8

    TIFF Tags

    The SLNSW requires the use of some of the data elements in ANSI/NISO Z39.87-2006 (R2011) Data Dictionary—Technical Metadata for Digital Still Images to record technical metadata in the TIFF header of each image. In addition to the required TIFF tags, some additional tags are utilised to document the provenance of each image.

    The following table shows the tags to be included in the Archival Master header:

    Tag ID Name Description Sample Values

    256 ImageWidth The number of columns in the

    image, i.e., the number of pixels

    per row.

    Positive integer count of pixels

    257 ImageLength The number of rows of pixels in

    the image

    Positive integer count of pixels

    258 BitsPerSample Number of bits per component

    for each pixel

    1 = 1-bit(bitonal)

    8 = 8-bit Grayscale

    8,8,8 = RGB (24-bit)

    16 = 16-bit Grayscale

    16,16,16 = RGB (48-bit)

    8,8,8,8 = CMYK

    259 Compression Compression scheme used on

    the image data.

    1=Uncompressed

    2=CCCIT 1D

    3=CCCIT Group 3

    4=CCCIT Group 4

    5=LZW

    6=JPEG

    262 PhotometricInterpretation The colour space of the image

    data

    0=WhiteIsZero (bitonal &

    Grayscale)

    1=BlackIsZero (bitonal &

    Grayscale)

    269 DocumentName Name of Document from which

    scanned image was made

    Match Point from CSV

    271 Make Camera/Scanner Make e.g. Sinar AG

    272 Model Camera/Scanner Model e.g. Sinarback Evolution 75,

    Sinar P3

    273 StripOffsets For each strip, the byte offset of

    that strip

    Positive Integer

  • © State Library of New South Wales – 2016 Page 27 of 36

    Tag ID Name Description Sample Values

    274 Orientation The orientation of the image with

    respect to the rows and column

    1=normal

    277 SamplesPerPixel The number of colour

    components per pixel

    1 = When

    PhotometricInterpretation = 0 or

    1 (bitonal or Grayscale)

    3 = When

    PhotometricInterpretation =2

    (RGB)

    4 = When

    PhotometricInterpretation = 5

    (CMYK)

    278 RowsPerStrip The number of rows per strip Positive integer

    279 StripByteCounts For each strip, the number of

    bytes in the strip after

    compression

    Positive integer

    282 XResolution The number of pixels per

    ResolutionUnit in the

    ImageWidth direction

    400

    283 YResolution The number of pixels per

    ResolutionUnit in the

    ImageLength direction

    400

    284 PlanarConfiguration How the components of each

    pixel are stored

    1 = Chunky format. The

    component values for each

    pixel are stored contiguously.

    For example, for RGB data, the

    data is stored as RGBRGBRGB

    2 = Planar format. The

    components are stored in

    separate component planes.

    For example, RGB data is

    stored with the Red

    components in one component

    plane, the Green in another,

    and the Blue in another

    296 ResolutionUnit The unit of measurement for

    XResolution and YResolution

    inch

    305 Software Name and version number of the

    software package(s) used to

    create the image

    e.g. CaptureShop V6.5

    306 DateTime Date and time of image creation YYYY:MM:DD hh:mm:ss

    315 Artist Used for imageProducer CONTRIBUTOR (supplied by Library in CSV)

    followed by Contractor’s name

  • © State Library of New South Wales – 2016 Page 28 of 36

    Tag ID Name Description Sample Values

    Name of the Contractor or

    production unit responsible for

    creation of the file

    700 XMP Extensible Metadata Platform for

    technical and descriptive

    metadata

    See 4.2.1 XMP above

    34675 ICCProfile ICC Colour profile name e.g. sRGB or,

    AdobeRGB(1998)

    33432 Copyright Copyright notice as supplied by

    Library

    RIGHTS from CSV (supplied by

    Library in CSV)

  • © State Library of New South Wales – 2016 Page 29 of 36

    6. QUALITY ASSURANCE

    Quality Assurance (QA) encompasses all procedures and techniques to verify the quality, accuracy and consistency of the deliverables.

    The Contractor’s Quality Plan must include a detailed overview of all Quality Assurance processes that will be conducted in providing the digitisation services and the measures that will be applied.

    The SLNSW will engage in a formal evaluation both to verify that the digital product of the participating Contractors meets the specifications detailed in this document and to evaluate the quality and acceptability of the digital imaging for further post-digitisation needs (e.g. OCR).

    The SLNSW reserves the right to reject any digital file or files which fail to meet the specifications and requirements detailed in this document, as determined by both automated and manual evaluation methods.

    The SLNSW also reserves the right to require the Contractor to rescan any page images which do not meet the specifications and requirements of this document or to refuse payment, up to and including the whole digital product produced for a shipment of materials. If the digital product of any Contractor fails to meet these specifications and requirements after rescan, the SLNSW reserves the right to require the return of materials for submission to another Contractor.

    The Library’s quality acceptance criteria are:

    Criteria Description Acceptance Level

    File name accuracy File names and directory names are accurate 100%

    File format accuracy File formats are per the Library’s specification 100%

    Image quality Images meet image quality specification 100%

    Completeness Required pages are captured and represented

    as a digital image (i.e. no page images are

    missing)

    100%

    Metadata accuracy Metadata and TIFF tags are present and

    accurate for auto-generated fields

    100%

    Delivery media Delivery media is readable and images can be

    ingested by the Library

    100%

  • © State Library of New South Wales – 2016 Page 30 of 36

    7. FILE NAMING AND FOLDER STRUCTURES

    File and Folder Naming Schema

    Instructions for specific file naming schema will be provided with documentation that accompanies each batch of Collection Materials.

    File Directory Structure – Archival Master Deliverables

    The following file directory structure is to be used:

    Level Item Name

    1st level Delivery Folder named: 3 Letter Project Code_4 Letter Vendor Code_Reverse Date_Delivery Number

    2nd level Administration file folder named: Admin Files containing Checksums, QA Certificates, Project Status Reports etc

    2nd level Batch Number = Workorder Number (e.g. D47549)

    3rd level The updated CSV for this workorder (e.g. D47549.csv)

    3rd level Intellectual Entity (IE) folder named using the stem of the Digital ID_Match Point: e.g. b08851_12345678

    4th level Master file folder named: Master_ Digital ID (e.g. Master_b08851)

    5th level Archival Master files sequence: contains all Master files for this ID, including Resolution Scans etc (e.g. b08851_0001_m.tif to…b08851_9999_m.tif

  • © State Library of New South Wales – 2016 Page 31 of 36

    Level Item Name

    Example screen shot

  • © State Library of New South Wales – 2016 Page 32 of 36

    File Naming and Digital ID Schema

    Each file will have a unique identifier. The naming pattern will be:

    Item Description

    Master file b08851_0004_m.tif (to …. b08851_9999_m.tif) or b08851_0001_r.tif

    Where: b08851 is the Digital ID

    0001 is a 4 digit sequence counter

    ‘m’ is the signifier for an Archival Master file

    ‘r’ is the signifier for an Archival Reference file such as a Title

    Scan, Resolution Scan or any Fold-out scan with a QP Card

  • © State Library of New South Wales – 2016 Page 33 of 36

    8. SERVICE LEVELS

    Services, responsibility and measure

    Service Contractor’s Responsibility

    Principal’s responsibility

    Measure Level required

    7.1.1 Digitisation of Collection Materials to create Archival Master files

    Meets the requirements set out in this SOR

    100%

    7.1.2 Supply of specific Administrative metadata

    Meets the requirements set out in this SOR

    100%

    7.1.3 Creation of Checksum data

    Meets the requirements set out in this SOR

    100%

    7.1.4 Quality assurance of the Deliverables produced from the services described in sections 7.1.1 and 7.1.2 (above)

    Meets the requirements set out in this SOR

    100%

    7.1.5 File transfer of the digitised files and return of Collection Materials (originals and digitised files)

    Meets the requirements set out in this SOR

    100%

    7.1.6 Retention of the digitised files until the integrity and Quality acceptance of the transferred files has been confirmed by SLNSW

    Meets the requirements set out in this SOR

    100%

    7.1.7 Project Management

    Meets the requirements set out in this SOR

    100%

    7.1.8 Change in Technical Evaluation Schedule

    Inform SLNSW’s Project Manager with supporting evidence in writing immediately as per clause 7.6 “Delays” of the Master Digitisation Services Agreement. (Part C)

    100%

    7.1.9 Response from SLNSW on change to Project Schedule

    Upon receiving the written advice about changes in project schedule, SLNSW’s Project Manager will respond back in 3 Business Days with the Principal’s decision on the change in project schedule

    100%

  • © State Library of New South Wales – 2016 Page 34 of 36

    Service Contractor’s Responsibility

    Principal’s responsibility

    Measure Level required

    7.1.10 Delivery of Digitised files (Deliverables)

    All deliveries to SLNSW must have a supporting QA Report

    100%

    7.1.11 Quality Inspection

    The quality of Deliverables will be assessed against requirements set out in the Statement of Requirements and the Delivery Report will be completed within one business day of receiving the Collection Materials

    100%

    7.1.12 Technical Evaluation

    SLNSW must complete inspections and certify acceptance within 30 days of receiving the Deliverables QA certified from the Contractor

    100%

  • © State Library of New South Wales – 2016 Page 35 of 36

    9. DELIVERABLES

    Matrix of deliverables mapped against each service:

    Service Deliverable

    8.1.1 Archival Master • Archival Masters must be created as per Section 3.6

    • Metadata must be embedded as specified in Section 4

    • Archival Master is to be saved as per the file naming convention in Section 6

    8.1.2 Addition of specific administrative information in the provided spreadsheets (CSV’s)

    Completed Summary spreadsheet (CSV)

    8.1.3 Addition of specific technical and administrative information in the provided spreadsheets

    Completed Itemised Collection Metadata spreadsheet

    8.1.4 Creation of Checksum data for each file and batch

    Checksum Manifest generated and supplied in specified file format

    8.1.5 Quality assurance of digitisation deliverables

    Files digitised as per Quality Acceptance Criteria (Section 9)

    8.1.6 File transfer and return of Collection Materials and digitised files

    Files transferred onto the supplied USB/Flash Hard drive/s,

    Items packed, Notification to the Library to arrange transport.

    8.1.7 Retention of digitised files Access to Contractor version of digitised files

    8.1.8 Project Management Compliance with SLNSW supplied templates for status reporting

  • © State Library of New South Wales – 2016 Page 36 of 36

    10. QUALITY ASSURANCE AND ACCEPTANCE OF DELIVERABLES

    The Contractor will on reasonable notice from the Library allow the Library to inspect materials and services associated with digitisation of the collection items as per this SOR.

    The Contractor’s services and the Deliverables must be of the professional standard and reflect expertise commensurate with standard commercial or industrial practice for activities of those required under this Agreement and shall be suitable for the Objectives and Purpose described in sections 1.2 and 1.3 of this document.

    Acceptance of deliverables will be measured against fulfilment of the requirements in this Statement of Requirements (including Annexures A to E).

    Quality Assurance Process

    The following acceptance process for Deliverables will apply:

    1. Delivery of Digitised files (Deliverables) - Contractor to the SLNSW

    All project Deliverables must have a supporting QA Certificate completed by the Contractor

    2. SLNSW QA Inspection The quality of Deliverables will be assessed against requirements set out in the Statement of Requirements

    3. SLNSW’s QA Inspection - Acceptance

    The Library must complete inspections and certify acceptance within 30 days of receiving the Deliverables QA certified from the Contractor

    4. SLNSW Quality Rejections

    The Contractor must correct any deficiencies identified by the SLNSW within its 30 days’ quality inspection review period at the Contractor’s own expense and resubmit the deliverable(s) within 10 Business days from the date of notification.

    5. Review by the SLNSW of rejected Deliverables

    The SLNSW must complete inspections and certify acceptance within 30 days of receiving the rejected Deliverables QA certified from the Contractor

    6. Resubmissions

    Rework should be submitted under the same workorder number as the original rejected work with a numerical counter on the workorder number to indicate that it is a resubmission. The names for the Masters should be the same as the files they are to replace. For example if there are files rejected under a workorder D12345 the resubmitted folders would be named: D12345 (the original submission) D12345_1 (the first resubmission) D12345_2 (the second resubmission)

    The Library will acknowledge the acceptance of deliverables by supplying an Acceptance Certificate.

    Technical Evaluation Assessment Criteria

    The Library’s assessment criteria are:

  • © State Library of New South Wales – 2016 Page 37 of 36

    Criteria Description Acceptance Level

    Archive Master -

    Compliance

    The extent of capture is to follow the SLNSW’s

    specification 100%

    Archive Master -

    Completeness

    Archive Master files exists for each page of volumes

    supplied as per the State Library NSW’s specification 100%

    Archive Master -

    Sharpness

    Image quality is to meet the SLNSW’s image quality

    specification 100%

    Archive Master - File

    format accuracy

    Archive Master file format have been created as per the

    SLNSW’s specification 100%

    Archive Master - File

    name accuracy

    Archive Master file names and directory structure are

    accurate and follow SLNSW’s specification 100%

    Archive Master -

    Metadata

    Metadata has been written to the Archive Master files

    as per the SLNSW’s specification 100%

    Archive Master files -

    Cropping

    Archive Master files have been cropped as per the

    SLNSW’s Specification 100%

    Volumes –

    Resolution Scans

    Each digitised file set from each volume is to contain

    Resolution scans as per the SLNSW’s Specification 100%

    Generate Checksum

    for each file

    MD-5, SHA-1 or SHA-256 checksums 100%

    Creation of

    Checksum Manifest

    Checksum manifest provided for each Collection as the

    file type CSV, TXT or XML

    100%

    File Delivery and

    Media - Operating

    System & File

    System

    Windows OS 7.0 compatible, NTFS and free of

    malware 100%

    Reporting -

    Spreadsheets

    Spreadsheets completed:

    Summary spreadsheet, Itemised Collection Metadata

    spreadsheet

    100%

  • © State Library of New South Wales – 2016 Page 38 of 36

    11. DEFINITIONS AND ACRONYMS

    Term Definition

    Accepted (work) Packages of work returned to the Library by the successful Contractors that meet all quality acceptance checks conducted by the Library

    Archival Master A high Resolution reproduction of the Original source material that contains content that the Library intends to maintain for the long term without loss of essential features.

    ALTO ALTO (Analysed Layout and Text Object) an XML Schema that details technical metadata for describing the layout and content of physical text resources

    Collection Item / Material

    Any item that forms part of the collection held by the Library

    Checksum Checksums are used to ensure the integrity of a file after it has been transmitted from one storage device to another. The checksum is calculated using an algorithm and is normally posted along with the download. To verify the integrity of the file, a user calculates the checksum using a checksum calculator program and then compares the two to make sure they match

    Checksum Manifest A log file containing a checksum plus its associated file path and file name metadata. Checksum manifests may also include other technical metadata, such as date-time stamps of files and information about the software and version that created the checksum manifest file

    Cropping/Cropped The process of reducing the size of images by removing parts that are near the edges

    Deliverable(s) Digital files or services that the successful Contractors will provide

    Descriptive Metadata Metadata describing and identifying information resources

    Deskewing The correcting of distortion caused by image capture from a viewpoint other than on the perpendicular

    Digital Imaging A term used generally to describe the process of creating and manipulating digital images

    Digitisation The process of converting analogue materials into digital objects. This involves not only digital imaging but also the production of Metadata and OCR-ing

    Foldouts Foldouts can be attached or contained as a loose item within the volume. A foldout page that is imaged on a device different to regular page images will be a Foldout Scan

    Image Manipulation The process of editing images e.g. de-skewing, cropping, rotating, de-speckling, etc.

    Image Optimisation for OCR

    The process of improving the digital image for improved OCR accuracy as well as display e.g. smoothing of characters, sharpening, increasing contrast, removing dirt and noise.

    Job A defined package of work identified within an Official Order (consists of multiple Work packages)

    Machine-readable Text

    A text file derived from a Source image through an OCR process

    Match Point Unique bibliographic identifier from the Libraries Australia Database

    Metadata Metadata is structured data that provides intelligence in support of more efficient operations on resources, such as preservation, reformatting, analysis, discovery and use. It operates at its best in a networked environment, but is still a necessity in any digital storage and preservation environment. Metadata instructs end-users (people and computerised programmes) about how the data is to be interpreted. Metadata is vital to the understanding, coherence and successful functioning of each and every encounter

  • © State Library of New South Wales – 2016 Page 39 of 36

    with the archived object at any point in its lifecycle and with any objects associated with or derived from it.

    There are three main types of metadata

    1: Descriptive metadata describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords. This is critical for Discovery.

    2: Structural metadata indicates how compound objects are put together, for example, how pages are ordered to form chapters. This is critical for Delivery

    3: Administrative metadata provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it. There are several subsets of administrative data; two that are sometimes listed as separate metadata types are:

    Rights management metadata, which deals with intellectual property rights,

    Preservation metadata, which contains information needed to archive and preserve a resource.

    Optical Character Recognition (OCR)

    The conversion of text images into machine-readable text

    Original Source Material

    The hard copy or other text-based material used to create Source images

    Page(s) A single page printed in any text-based item

    Printed Materials SLNSW Collection Category consisting of Books, Pamphlets, Catalogues, Serials

    Quality Assurance (QA)

    Quality assurance (QA) encompasses all procedures and techniques to verify the quality, accuracy, and consistency of the deliverables

    Reference File A high resolution file of content that the Library intends to maintain for the long term without loss of essential features. It includes Title Scans, Resolutions Scans and Foldouts with QP cards. These files will not be displayed through the catalogue.

    Resolution Maximum spatial Resolution without recourse to interpolation. Resolution imaged by optical system in a capture device – without subsequent software interpolated pixels

    Rotating Process of turning (rotating) the image file so that the top of the page appears at the top of the image file

    Sharpening A software method of exaggerating ‘edges’ in an image to give an enhanced definition

    Source Image, Page Image

    An image representing a scanned book/serial/catalogue page

    State Library State Library of New South Wales (SLNSW)

    SOR Statement of Requirements

    Technical Metadata Metadata that describes the technical attributes of the digital image

    TIFF Tagged Image File Format, a file format for storing images

    Volume A single book/serial/journal/catalogue belonging to the SLNSW’s printed materials collection

    Work Package A package of work that constitutes a batch of volumes to be digitised

  • © State Library of New South Wales – 2016 Page 40 of 36

    Work Report A customised report outlining detail about completed work packages

    Work Schedule The schedule in which Work packages are to be sent by the Library and delivered by the successful Contractors.

    12. STANDARDS REFERENCES & VERSION REQUIREMENTS

    The Library’s standards are based on internationally applied standards as set out in the document:

    TIFF – TIFF Revision 6.0 & Supplements.

    o http://partners.adobe.com/public/developer/en/tiff/TIFF6.pdf

    o http://partners.adobe.com/public/developer/tiff/index.html

    ANSI/NISO Z39.87-2006 (R2011) – Data Dictionary—Technical Metadata for Digital Still Images

    o http://www.niso.org/apps/group_public/download.php/14698/z39_87_2006_r2011.pdf

    XMP – “ISO 16684-1:2012 Graphic technology -- Extensible metadata platform (XMP)

    specification” and “Adobe XMP Specification”.

    o http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=57421

    o http://www.adobe.com/devnet/xmp.html

    Adobe RGB (1998) ICC profile

    o https://www.adobe.com/digitalimag/pdfs/AdobeRGB1998.pdf

    ANSI

    http://partners.adobe.com/public/developer/en/tiff/TIFF6.pdfhttp://partners.adobe.com/public/developer/tiff/index.htmlhttp://www.niso.org/apps/group_public/download.php/14698/z39_87_2006_r2011.pdfhttp://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=57421http://www.adobe.com/devnet/xmp.htmlhttps://www.adobe.com/digitalimag/pdfs/AdobeRGB1998.pdf

  • © State Library of New South Wales – 2016 Page 41 of 36

    13. LIST OF ANNEXURES

    Annexure A – Project Management Status Report for completion by the Contractor

    The Project Management Status Report is to be used to track progress against the agreed baseline of the project. It is a requirement this is updated and sent to the Library on a regular basis (agreed by both parties).

    Annexure B – Quality Assurance Report for completion by the Contractor

    The Quality Assurance Report is to be completed by the Contractor at the end of digitisation for the Technical Evaluation. It must be included with the Archive Master files when being sent back to the Library.

    Annexure C – Deliverables Acceptance Criteria for completion by the State Library of NSW

    The Acceptance Criteria/Quality Assurance Report will be completed by the Library at the completion of their QA activities for the Technical Evaluation.

    Annexure D – Collection Metadata Spreadsheets for completion by the Contractor

    Itemised Collection Metadata Spreadsheet – this spreadsheet is used to capture and add various descriptive, technical and administrative metadata for each item and/or files in the collection. The Contractor is required to follow the instructions in the spreadsheet and add required metadata.

    Summary for reporting spreadsheet – this spreadsheet is used to provide summary details regarding what collections are to be digitised, including information such as number of collection items/volumes, estimated page numbers etc.

    Annexure E – Sharpness Comparison Text File.jpg

    Sharpness reference file to visually determine acceptance levels for in and out of focus text.