Upload
lisa-cooper
View
212
Download
0
Embed Size (px)
Citation preview
CASE STUDY:STATISTICS NORWAY (SSB)
Jenny Linnerud and Anne Gro Hustoft
Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS)
Luxembourg 9-11 April 2008
Organisation structure
Introduction
Overall aim of SSB’s metadata strategy 2005
• To create a comprehensive metadata systemthat will contribute to an efficient production and dissemination of statistics, and at the same timeimprove the quality of the statistics.
• Our metadata should be updated in one placeand accessible everywhere.
Metadata systems completed before 2005
• Datadok – archive file descriptions
• Stabas - standard classifications database
• Metadb - metadatabase for event history data
• StatBank - dissemination database
Metadata projects completed after 2005
• Documentation of key metadata concepts
• Vardok - variables documentation system
• About the statistics as a content management system instead of just a document.
• Metadata portal on the Internet
Current metadata projects1. Improved editing tool for our classification database
2. Analysis of the end-to-end creation and re-use of metadata in one production cycle for one statistic.
3. Master metadatabase for questionnaires.
4. Service library for master metadata systems.
5. Metadata intranet portal.
6. Improved plan system for projects, products and processes.
7. Improved access to micro-data for researchers
Plan and design
2
Develop
3
Collect
4
Process
5
Analyse
6
Disseminate
7
Methodology and infrastructure
Quality measurement and control 8
Need
1
Statistical business process model
Metadata used/created at each phase
• Need - Plan system• Plan and Design - Check what is already available in all
metadatasystems. Ideally update all these but in practise this happens as the final step in the process if time permits.
• Develop - Update the metadatabase for questionnaires and rules. Service library for metadata systems.
• Collect - Metadatabase for questionnaires and rules. Metadb.
• Process - Service library for metadata systems. Metadatabase for rules. Stabas.
• Analyse - We use commercial software such as SAS. Metadata support could be better.
• Disseminate - Metadata portal, About the statistics, About the data collections, StatBank. Service library for metadata systems (Stabas, Vardok, Datadok, Metadb). Archival of flat files in Datadok.
Benefits
Statistics Norway’s strategy documents emphasise:• Metadata systems contribute to simplifying,
improving and re-use of work processes• Data that are disseminated and exchanged must
in addition to an agreed structure have sufficient metadata to give them meaning
• Use of metadata systems are a pre-condition for the development of efficient data capture solutions.
Costs
1. Variables documentation system Vardok• A total of 12690 man-hours have been used in
development with ca. 70% of resources from IT.• 2006 was the last year in the development
phase for the Vardok-project • A total of 476 man-hours from standards were
used in 2007 for continued harmonisation of names and definitions, and training of personnel in the subject matter divisions.
• 294 IT man-hours were used in 2007 for maintenance and minor changes to the system
Costs
2. Metadata portal (man-hours)
Service library for metadata systems
(under development)
The purpose of this project is to • Create a library of services for the master systems
Vardok, Datadok, Metadb and Stabas.• Define a framework for the description and formulation of
SSB's metadata based on international metadata models (e.g. Neuchâtel) and standards (e.g. ISO/IEC 11179).
• Investigate how RDF (Resource Description Framework) can be integrated into SSB's data communication.
The project began in 2005 and will end in 2008
IT architectureIT strategy 2007
Statistics Norway's technical solutions shall bebuilt mainly upon the principles of service-orientedarchitecture. All solutions for external users and most solutionsfor internal users shall:• Have support for open standards.• Be platform independent.• Be component based.• Have support for the packing in of data andfunctions in the form of services (web services).
System architects
• System architects are introduced for each
of the following areas in the top-level
information architecture: data collection,
metadata and dissemination.
• System architects have a responsibility to
ensure that IT development projects are in
line with the IT strategy.
Standards and formats
• Our classifications system is an implementation of Neuchâtel Terminology Model Part 1 Classifications v2.0.
• Our variables system is a partial implementation of Neuchâtel Terminology Model Part 2 Variables.
• We are considering using DDI in connection with micro-data for researchers.
• We have used definitions of key metadata concepts from SDMX MCV where possible.
Data Element Value Domain
0..N
1..1
1..1
0..N
1..1
0..N
0..N
1..1
+Having
+Specifying
+Represented by
+Representing
+Expressed by
+Expressing
+Representing
+Represented by
Data Element Concept Conceptual Domain
Vardok Stabas
Datadok & Metadb
ISO/IEC 11179: Metadata registries & SSBs master systems
Roles
• Subject matter statistician
• Survey manager
• Senior advisor in standards
• IT-developer
• System architect
• Web designer
Metadata management team
Development of new metadata systems
• 1 senior advisor in standards
• 1 system architect for metadata systems
• 2 IT-developers
Management of metadata systems
• IT-services and IT-infrastucture
Collaboration
• Scandanavia – dissemination database
• Neuchâtel – classifications
• Statistical Open Source – process model
• Metadata portal– contact us!
Training
• 2 hours of an eight day course for all new employees 3-6 months after employment
• Metadata forum ca. twice a year – open for all employees
• Approval process for system development documents – IT-developers and project leaders
• Invited presentations to all three levels of the organisation
• Regular meetings with middle management
Lessons learned
1. Top management support is essential– metadata strategy, IT strategy, key concepts
2. Step-wise development of metadata systems– user involvement, regular deliveries
3. Continous follow-up of progress and quality– Necessary and unnecessary differences– Internet as motivational factor
4. Include metadata systems in the production cycle
– capture as early as possible and re-use
The End