Upload
smita-chandra
View
976
Download
1
Embed Size (px)
DESCRIPTION
Citation preview
Digital Preservation Digital Preservation of of
Geoscience InformationGeoscience Information
Smita ChandraSmita Chandra
LibrarianLibrarian
22
33
Importance of Digital Information Importance of Digital Information PreservationPreservation
1975 – Two Viking space probes sent to Mars by USA.1975 – Two Viking space probes sent to Mars by USA.
Data generated by unrepeatable mission cost $1 billion.Data generated by unrepeatable mission cost $1 billion.
Recorded data on magnetic tapes was corrupted / Recorded data on magnetic tapes was corrupted / unidentifiable after 2 decades despite being kept in unidentifiable after 2 decades despite being kept in climate controlled environment. climate controlled environment.
Scientists could not access data, unable to decode the Scientists could not access data, unable to decode the formats used. formats used.
44
Importance of Digital Information Importance of Digital Information PreservationPreservation
Original format developers not alive.Original format developers not alive.
Finally old printouts tracked and retyped.Finally old printouts tracked and retyped.
NASA therefore is the biggest supporter of Digital NASA therefore is the biggest supporter of Digital Preservation Projects. Preservation Projects.
This illustrates wide gap in information generation and its This illustrates wide gap in information generation and its management. management.
55
Outline of PresentationOutline of Presentation
Digital information: forms and typesDigital information: forms and typesGeoscience informationGeoscience information Institutional Repositories (IR)Institutional Repositories (IR)Digital Preservation (DP); strategies for Digital Preservation (DP); strategies for
DPDPOAIS model & its implementationOAIS model & its implementation Indian scenario Indian scenario Research proposal & expected resultsResearch proposal & expected results
66
Digital InformationDigital Information
Information in digital formInformation in digital form Born DigitalBorn Digital Converted from AnalogConverted from Analog
Types of Digital InformationTypes of Digital Information Electronic PublicationsElectronic Publications Organizational and Personal RecordsOrganizational and Personal Records DataData Learning Objects like articles, booksLearning Objects like articles, books Software ToolsSoftware Tools Unpublished MaterialsUnpublished Materials Electronic ManuscriptsElectronic Manuscripts Entertainment ProductsEntertainment Products Images (Digitally designed or digitized)Images (Digitally designed or digitized) WebsitesWebsites
77
Threats Threats Media decay and failureMedia decay and failure
Massive storage failures,Massive storage failures, outdated mediaoutdated mediaAccess Component Access Component Obsolescence Obsolescence
Outdated formats, applications & systemsOutdated formats, applications & systemsHuman and Software errors Human and Software errors && External EventsExternal Events
88
Information DelugeInformation DelugePresent & Future ProjectionsPresent & Future Projections
Yawning gap betweenYawning gap between
Our ability to create digital informationOur ability to create digital information Our infrastructure and capacity to manage and Our infrastructure and capacity to manage and
preserve it over timepreserve it over time Cumulative effect foreseen as future “digital dark Cumulative effect foreseen as future “digital dark
ages”ages”
99
Need for Digital PreservationNeed for Digital Preservation
preserving natural/cultural heritagespreserving natural/cultural heritages
for promoting academic researchfor promoting academic research
enabling public access to legacy enabling public access to legacy collectionscollections
1010
Geoscience InformationGeoscience Information
Encompasses complex human-natural systemEncompasses complex human-natural system
Storehouse of massive heterogeneous data sets, and a Storehouse of massive heterogeneous data sets, and a wide variety of wide variety of content and data types which reflect the features of various research content and data types which reflect the features of various research fields of study fields of study
Every content holder aim at the needs of their particular community Every content holder aim at the needs of their particular community and work independently with a loose collaboration and integrationand work independently with a loose collaboration and integration
Every content holder has their respective digital archive system with Every content holder has their respective digital archive system with individual data structure, management policy and search interface, individual data structure, management policy and search interface, however, there is an inability to transform and integrate data with each however, there is an inability to transform and integrate data with each other transparentlyother transparently
Enabling and improving the interoperability for heterogeneous Enabling and improving the interoperability for heterogeneous collections is importantcollections is important
Source : Loudon, T.V. Geoscience after IT : Part A & Part B. Computers & Geosciences, 2000, Source : Loudon, T.V. Geoscience after IT : Part A & Part B. Computers & Geosciences, 2000, 2626(3A), (3A), A1-13.A1-13.
1111
Institutional Repositories (1)Institutional Repositories (1)
DefinitionDefinition : :An institute-based repository is a set of An institute-based repository is a set of
services that an academic institution services that an academic institution offers to the members of its community offers to the members of its community for the management and dissemination for the management and dissemination of digital materials created by the of digital materials created by the institution and its community members.institution and its community members.
Source: Clifford A. Lynch (February 2003), “Institutional Repositories: Essential Infrastructure for Scholarship in the Digital Age” ARL Bimonthly Report 226: 1-7. http://www.arl.org/newsltr/226/ir.html
1212
Institutional Repositories (2)Institutional Repositories (2)
Main ObjectivesMain Objectives to create global visibility for an institution's to create global visibility for an institution's
scholarly research; scholarly research; to collect content at a single location; to collect content at a single location; to provide to provide open accessopen access to institutional research to institutional research
output by output by self-archivingself-archiving it; it; to store and to store and preservepreserve other institutional digital other institutional digital
assets, including unpublished or otherwise easily assets, including unpublished or otherwise easily lost ("grey") literature (e.g., theses or technical lost ("grey") literature (e.g., theses or technical reports). reports).
1313
Institutional Repositories (3)Institutional Repositories (3)
IR SoftwaresIR Softwares DSpace (dspace.mit.edu)DSpace (dspace.mit.edu) Eprints.orgEprints.org
Subject Specific IRsSubject Specific IRs arXiv (arXiv (www.arXiv.orgwww.arXiv.org)) RePEc (Research Papers in Economics) (RePEc (Research Papers in Economics) (
www.repec.orgwww.repec.org)) CogPrints (CogPrints (www.cogprints.orgwww.cogprints.org)) NASA Technical Report Server (NASA Technical Report Server (ntrs.nasa.govntrs.nasa.gov)) Networked Computer Science Technical Reference Networked Computer Science Technical Reference
Library (Library (www.ncstrl.orgwww.ncstrl.org))
1414
Institutional Repositories (4)Institutional Repositories (4)
An IR is a model for a preservation system An IR is a model for a preservation system
It requires “most essentially an organizational commitment to the It requires “most essentially an organizational commitment to the stewardship of … digital materials, stewardship of … digital materials, including long-term including long-term preservationpreservation where appropriate, as well as organization and where appropriate, as well as organization and access or distribution”access or distribution”
Attributes of a “Trusted Digital Repository” Attributes of a “Trusted Digital Repository”
“… “…an organisation that has responsibility for the long-an organisation that has responsibility for the long-term maintenance of digital resources, as well as term maintenance of digital resources, as well as making them available [through time and across making them available [through time and across changing technologies] to communities agreed on by changing technologies] to communities agreed on by the depositor and the repositorythe depositor and the repository.” .”
Research Libraries Research Libraries Group Group
http://www.rlg.org/longterm/attributes01.pdfhttp://www.rlg.org/longterm/attributes01.pdf
1515
DefinitionDefinition: : Digital PreservationDigital Preservation
The maintenance of digital materials over the long-termThe maintenance of digital materials over the long-termwith a view to ensuring its continued accessibility. Itwith a view to ensuring its continued accessibility. Itensures that the digital resources are stored correctlyensures that the digital resources are stored correctlyand maintained adequately in the online world, suchand maintained adequately in the online world, suchthat they are available consistently for use over time.that they are available consistently for use over time.
““Long-termLong-term” includes timescales of decades or even centuries” includes timescales of decades or even centuries
1616
Preservation StrategiesPreservation Strategies
Technology preservationTechnology preservation Keep the hardware alive Keep the hardware alive
Technology emulationTechnology emulation Create an environment to be able to run the Create an environment to be able to run the
existing software existing software
Data migrationData migration Convert data to new formats to run in new Convert data to new formats to run in new
applications applications
1717
Open Archival Information Open Archival Information System (OAIS)System (OAIS)
Published by Consultative Committee for Space Data System Published by Consultative Committee for Space Data System (CCSDS) 2002, ISO 14721 : 2003 standard(CCSDS) 2002, ISO 14721 : 2003 standard
An archive consists of an organization of people and systems An archive consists of an organization of people and systems with responsibility to preserve information and make it available with responsibility to preserve information and make it available to users. to users.
SIP = Submission Information PackageAIP = Archive In formation PackageDIP = Dissemination Information Package
1818
OAIS: DefinitionsOAIS: Definitions
To define an Open Archival Information SystemTo define an Open Archival Information System The term 'open' means that the document was developed in The term 'open' means that the document was developed in
an open way, and does not imply that access to any OAIS an open way, and does not imply that access to any OAIS should be unrestrictedshould be unrestricted
An archive is defined as an "organization that intends to An archive is defined as an "organization that intends to preserve information for access and use by a designated preserve information for access and use by a designated community." (p. 1-8)community." (p. 1-8)
While an OAIS itself need not be permanent, the information While an OAIS itself need not be permanent, the information being maintained has been deemed to need "Long Term being maintained has been deemed to need "Long Term Preservation"Preservation"
Long term = long enough for there to be a concern about the Long term = long enough for there to be a concern about the impact of changing technologiesimpact of changing technologies
1919
OAIS: Purpose and Scope OAIS: Purpose and Scope
Primary focus on digital informationPrimary focus on digital information Specific aims include:Specific aims include:
A framework for the understanding and awareness of the A framework for the understanding and awareness of the archival concepts needed for long term preservation (access)archival concepts needed for long term preservation (access)
Terminology and concepts for Terminology and concepts for describing and comparingdescribing and comparing:: Architectures and operationsArchitectures and operations Preservation strategies and techniquesPreservation strategies and techniques Data modelsData models
Consensus on elements and processes for long term Consensus on elements and processes for long term preservationpreservation
A foundation for other standardsA foundation for other standards
2020
OAIS: ApplicabilityOAIS: Applicability
ApplicabilityApplicability::Applicable to any archive, but mainly focused on Applicable to any archive, but mainly focused on
organisations with responsibility for making organisations with responsibility for making information available for the long terminformation available for the long term
Of interest to those who create informationOf interest to those who create information
ConformanceConformanceAn OAIS must support the information model - but An OAIS must support the information model - but
does not specify any particular method of does not specify any particular method of implementationimplementation
Mandatory responsibilities (section 3.1)Mandatory responsibilities (section 3.1)
2121
Implementing OAIS (1)Implementing OAIS (1) Summing up the fundamentals :Summing up the fundamentals :
OAIS is a reference model (conceptual framework), NOT a OAIS is a reference model (conceptual framework), NOT a blueprint for system designblueprint for system design
It informs the design of system architectures, the development It informs the design of system architectures, the development of systems and componentsof systems and components
It provides common definitions of terms, a common language It provides common definitions of terms, a common language and means of making comparisonand means of making comparison
But it does NOT ensure consistency or interoperability between But it does NOT ensure consistency or interoperability between implementationsimplementations
2222
Implementing OAIS (2)Implementing OAIS (2)
2323
Implementing OAIS (3)Implementing OAIS (3)
2424
Implementing OAIS (4)Implementing OAIS (4)
2525
Summing Up : OAISSumming Up : OAIS
The OAIS model is a foundation stone for The OAIS model is a foundation stone for current and future digital preservation effortscurrent and future digital preservation efforts
It is already widely used to inform the It is already widely used to inform the development of preservation tools and development of preservation tools and repositoriesrepositories
It could be used in the future as a basis for It could be used in the future as a basis for conformanceconformance
2626
Indian Scenario (1)Indian Scenario (1)
Open Digital RepositoryOpen Digital Repository Indian Institute of ScienceIndian Institute of Science ((http://etd.ncsi.ernet.inhttp://etd.ncsi.ernet.in))
National Chemical LaboratoryNational Chemical Laboratory ((http://dspace.ncl.res.in/dspace/index.jsphttp://dspace.ncl.res.in/dspace/index.jsp) )
Indian Statistical InstituteIndian Statistical Institute ((http://library.isibang.ac.in:8080/dspace/index/jsphttp://library.isibang.ac.in:8080/dspace/index/jsp))
Social Science DataSocial Science Data The Census of IndiaThe Census of India M.S.Swaminathan Research FoundationM.S.Swaminathan Research Foundation
Museums and Art GalleriesMuseums and Art Galleries Ministry of Culture, GOIMinistry of Culture, GOI The National ArchivesThe National Archives
2727
Indian Scenario (2)Indian Scenario (2)
Institute Institute ResourceResourceCentral Water CommissionCentral Water Commission Command area mapsCommand area maps
National Bureau of Soil Survey and National Bureau of Soil Survey and
Soil MapsSoil Maps Soil maps and land use dataSoil maps and land use data
Survey of India (SOI)Survey of India (SOI) Topographical maps, geodetic trigonometric Topographical maps, geodetic trigonometric and levelling data, gravity & geomagnetic data, and levelling data, gravity & geomagnetic data, GPS data, tidal data, repetitive geodetic & GPS data, tidal data, repetitive geodetic & geophysical data geophysical data
Geological Survey of India (GSI) Geological Survey of India (GSI) Geological maps on various scales, geological Geological maps on various scales, geological and seismic dataand seismic data
National Remote Sensing AgencyNational Remote Sensing Agency
(NRSA)(NRSA)
Satellite imageries, land use and wasteland Satellite imageries, land use and wasteland maps on different scalesmaps on different scales
Indian Meteorological Department Indian Meteorological Department (IMD)(IMD)
Meteorological and seismic dataMeteorological and seismic data
Ministry of Ocean Development Ministry of Ocean Development (MOD)(MOD)
Oceanic dataOceanic data
2828
Proposal for IRs in IndiaProposal for IRs in India
1.1. Providing adequate financial and technical resources for ensuring “digital Providing adequate financial and technical resources for ensuring “digital preservation” in IRs preservation” in IRs
2.2. National Informatics Center (NIC) entrusted with framing guidelines and National Informatics Center (NIC) entrusted with framing guidelines and policypolicy
or establishing a new agencyor establishing a new agency
For handling digital preservation, for collaboration, sharing and avoiding For handling digital preservation, for collaboration, sharing and avoiding duplicationduplication
3.3. Trusted Digital Repository for accurate and reliable informationTrusted Digital Repository for accurate and reliable information
4.4. Legally sustainable digital preservation policyLegally sustainable digital preservation policy
5.5. Joining the Digital Preservation ConsortiumJoining the Digital Preservation Consortium
6.6. Attention to collection management of digital material in librariesAttention to collection management of digital material in libraries
7.7. Amendment of the Delivery of Books Act and Press and Registration Act Amendment of the Delivery of Books Act and Press and Registration Act to cover the digital materialto cover the digital material
8.8. Training of manpower for the management and preservation of electronic Training of manpower for the management and preservation of electronic recordsrecords
9.9. Research in the area of digital preservationResearch in the area of digital preservation
2929
Research ObjectivesResearch Objectives
Testing a pilot IR in a stand alone modeTesting a pilot IR in a stand alone mode Implement an OAIS-compliant layer to the IR Implement an OAIS-compliant layer to the IR
drawing upon best practicesdrawing upon best practices To develop a preservation strategy and a To develop a preservation strategy and a
custom made model addressing issues like custom made model addressing issues like planning and policy for preservation, the role of planning and policy for preservation, the role of different players in the process, IPR and different players in the process, IPR and copyright, etccopyright, etc
3030
Research MethodologyResearch Methodology
Analog Materials
Digital Preservation
Converted
Born
Institutional Repository
Digitization Process Digital Materials
Material Selection Process
Short TermLong Term
3131
Expected ResultsExpected Results
This research would identify all the This research would identify all the components necessary for the components necessary for the implementation of the OAIS model for a implementation of the OAIS model for a geoscience domain specific institutional geoscience domain specific institutional repositoryrepository
3232
3333
Annexure 1Annexure 1
Preservation Description Information
Provenence
Context
Reference
Fixity
Content Data Object Representation
Information
Physical Object Digital Object
3434
Annexure 2Annexure 2
OAIS Mandatory Responsibilities:OAIS Mandatory Responsibilities:Negotiating and accepting informationNegotiating and accepting informationObtaining sufficient control of the information Obtaining sufficient control of the information
to ensure long-term preservationto ensure long-term preservationDetermining the "designated community" Determining the "designated community" Ensuring that information is "independently Ensuring that information is "independently
understandable"understandable"Following documented policies and Following documented policies and
procedures procedures Making the preserved information availableMaking the preserved information available
3535
Annexure 3Annexure 3