Metadata management and statistical business process at Statistics Estonia Work Session on...

Preview:

Citation preview

Metadata management and statistical business process at Statistics EstoniaWork Session on Statistical Metadata

(Geneva, Switzerland 8-10 May 2013)

Kaja Sõstra, Eda Froš

Outline

Overview of integrated metadata management system Statistical information system and business process model Current metadata projects Future plans Conclusions

Integrated metadata management system iMETA

Outsourced project August 2010 – July 2011 Integration of iMETA with other subsystems of our

statistical information system (SIS) Metadata driven SIS

Bilingual application; metadata will be stored in Estonian and in English

Purposes of the new metadata management system

To support the whole statistical business process; To enable the storage of metadata once for all usages; To act as an instrument for harmonising and standardising

metadata; To act as a central repository for all subsystems and for

various outputs regardless of purpose, media, format and time.

Source documents

Neuchâtel Terminology Model – classification database object types and their attributeshttp://www3.ssb.no/stabas/DOCS/Neuchatelversion2.1.pdf

Neuchâtel Termonology Model – variables and related conceptshttp://www.ssb.no/english/metadata/metadatadocuments/varneuchatelnodel.pdf

MMX Metadata Framework – implementation of MOF (built on relational database) technologyhttp://www.mmxframework.org/

Content of iMETA

Metadata navigator Statistical activities Classifications Concepts Statistical units types and statistical characteristics Measurement units Information about questionnaires Legal acts Databases

General architecture of metadata system

User interface

Metadata navigator

Metadata repository contains metadata managed by iMETA application and also metadata managed by other applications.

Metadata navigator gives an overview of all metadata stored in metadata repository (terminology objects, SQL objects, etc.)

Classifications

Classification – kind of umbrella that comprises one or several classification versions

Classification version – structured list of mutually exclusive categories

Classification variant – the original categories of classification version are split or regrouped to provide context-specific additions to the standard structure

Correspondence tables

Classification screen

Statistical activities (I)

Statistical activity –the collection, storage, transformation and distribution of statistical information

In Statistics Estonia the concept of statistical activity includes not only the conducted statistical surveys, but also management of statistical registers, compilation of yearbooks and analytical publications as well as other works related to the production of statistics

Every year a new version (instance) of statistical activity is being described

Statistical activities (II)

The description of statistical activity is based on ESMS concepts, supplemented by special attributes needed for Statistics Estonia.

All attributes are grouped as: General information Methodology Quality Dissemination VVIS (electronic data collection system) E-respondent

New attributes and groups can be added

Statistical activity screen

Statistical activities (III)

Description of statistical activities enables To present descriptions of surveys according to Euro-

SDMX structure (ESMS) on the web To create a document of Statistical Programme for 5 years; To present list of conducted statistical activities with short

description by years on the web To create XML file according to Euro-SDMX MSD

Defining variables

Simplified variables model

Architecture of the information system

Dissemination

Statistical registers

Metadata system

Data collection ProcessingStatistical analysis

Persons

Administrative registers

Users

Economic entities

Data Warehouse

iMETA

VVISADAM eGeostat

SRS

VAISeSTAT

PX-Web

Census-HUB

KUNDE

Analyse

2002Statistical

Farm Register

CRM

2006

Px-web

1994Statistical Business Register

eStat

2006

VVIS

20112006

economicentities

2006economic entities

2001metadata

management

2011iMeta

project started 2011 system for

statistical registers

project started2011

project started2011 persons

planned

Generic Statistical Business Process Model

project started

2012

Links between metadata repository and other systems

VAIS iMETA eSTAT, ADF

URMA

VVIS KUNDESRS

SMDX repo META

Current metadata projects

Implementation of ESS metadata standards on describing statistical activities and disseminating data

Describing reference metadata (in the metadata system) for all statistical activities according to ESMS.

Modernisation of reference metadata describing and update processes

Preparation for dissemination of reference metadata on the website

Technical development of iMeta

Future plans

Dissemination of ESMS based reference metadata on the web is planned on the 1st of July this year

Release of concepts and definitions on the web of Statistics Estonia. Replacement of current HTML version of concepts and methodology in output database

Further development of iMeta in line with developments of other components of statistical information system

Conclusions

Several parallel developments course problems of specification common requirements for all systems. Unfortunately, all new developments bring along some changes in the metadata system

Responsible unit and persons should be appointed for management of metadata

Creation of new metadata, filling in the gaps and harmonisation is very labour-intensive. Support from management and other people in the office is essential for success.

Special guidelines and rules should be created. All potential internal users of metadata system should be informed about the development process and involved in it.

Thank you for your attention!kaja.sostra@stat.ee

Recommended