15
www.destatis.de Innovative data collection using big data sources for official statistics Natalie Rosenski, Maren Köhlmann (Destatis) UNECE Workshop on Statistical Data Collection 14 October 2019, Geneva © Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Innovative data collection using statistics

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Innovative data collection using  statistics

www.destatis.de

Innovative data collection using big data sources for official statistics

Natalie Rosenski, Maren Köhlmann (Destatis)

UNECE Workshop on Statistical Data Collection

14 October 2019, Geneva

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Page 2: Innovative data collection using  statistics

www.destatis.de

Content

2

Background

Topic 1: ESSnet Big Data II – WP L

Topic 2: Remote Sensing Data

Topic 3: Mobile Network Data

Outlook

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Page 3: Innovative data collection using  statistics

www.destatis.de

Background

3

Improvement of timeliness, accuracy and relevance

Reduction of response burden

Use of new digital data for official statistics

Diverse data sources

» Mobile Network Data

» Remote Sensing Data

» Data from the Internet of Things (IoT)

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

© Gerd Altmann, Pixabay

Page 4: Innovative data collection using  statistics

www.destatis.de

ESSnet Big Data II – WP L

4

12 partner from 11 countries: AT, BG, FI, FR, IT, NL, NO, PL, PT, UK & DE

Germany (Destatis) in lead for WP L “Preparing Smart Statistics”

Duration: November 2018 – October 2019

Goals:

» Exploration of the IoT in order to produce trusted smart statistics

» Overview of relevant topics for official statistics

» Recommendations for possible follow-up studies

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 5: Innovative data collection using  statistics

www.destatis.de

ESSnet Big Data II – WP L

5

4 Tasks

» Smart Farming

» Smart Cities

» Smart Devices

» Smart Traffic

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 6: Innovative data collection using  statistics

www.destatis.de

Remote Sensing Data

6© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Smart Business Cycle Statistics» Flash estimates of economic indicators

Makswell » MAKing Sustainable development and WELL-being frameworks work for policy analysis

Deep Solaris» (Semi-)automatic analysis of satellite and aerial images for energy system transformation

and sustainability indicators

ESSnet Big Data II – WP H “Earth Observation”» Linking remote sensing data to survey data

Page 7: Innovative data collection using  statistics

www.destatis.de

Remote Sensing Data - Challenges

7

» Frequency of images and weather-conditions

» Radar images vs. optical satellite images

» High resolution data only commercially available and costly

» Low coverage of high resolution images

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 8: Innovative data collection using  statistics

www.destatis.de

Remote Sensing Data – Example: ESSnet Big Data II

8

ESSnet Big Data II - WP H “Earth observation”

Analysis of quality of life indicators

» Indicators with spatial dimension

» Urban heat islands, urban greens, noise pollution, air quality

» Linking neighbourhood characteristics derived from EO data to socio-demographic characteristics from the (micro) census

» Inhabitants, age structure, civil status, income, education

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 9: Innovative data collection using  statistics

www.destatis.de

Remote Sensing Data – Example:ESSnet Big Data II

9© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

© Calculations by BKG using Landsat 8 data

Surface temperature – heat islands

© Google Maps, Rhine-Main region

Page 10: Innovative data collection using  statistics

www.destatis.de

Mobile Network Data

10

ESSnet Big Data II: WP I “Mobile Network Data”

» Dynamic representation of the population via mobile network data

Comparison of static mobile network data with census data

» To represent day and resident population

Exploration of dynamic mobile network data for commuter statistics

» To illustrate commuter flows

» Compare data with census 2011 and existing commuter atlas

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 11: Innovative data collection using  statistics

www.destatis.de

Mobile Network Data

11

Anonymised and aggregated mobile network data

» Minimum activity per grid ≥ 30 (lately ≥ 5)

» Test data of one German federal state for a statistical week

» Dwell time > 2 hours

» Specific geometry

» Demographic characteristics: age group, gender, mobile country code

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

32%

35%

33% Telekom

Vodafone

Telefónica

Market share of mobile network operators (Germany, 2nd quarter 2019)

Data source: Federal Network Agency

Page 12: Innovative data collection using  statistics

www.destatis.de

Mobile Network Data - Challenges

12

» No legally regulated permanent data access

» No distinction between personal and device-related SIM cards

» Socio-demographic characteristics only for contractual customers

» Bias due to double counting of persons possessing several mobile phones and family tariffs

» Insufficient disclosure of the extrapolation methodology used by providers

» Available periods of mobile network data and census data do not always match well enough

» etc.

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 13: Innovative data collection using  statistics

www.destatis.de

Mobile Network Data – Example

13

Illustration of the population via mobile network data

Graph: Pearson correlation coefficients for census values and mobile network data by statistical days and time periods

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Time of day

Pear

son

corr

elat

ion

coef

fici

ent

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 230.62

0.67

0.72

0.77

0.82

0.87

Monday

Tuesday-Thursday

Friday

Saturday

Sunday

Page 14: Innovative data collection using  statistics

www.destatis.de

Outlook

14

Combination of survey, administrative and new digital data

Very promising to enrich the available official data

» Improve quality of data

» Accuracy: Provide more detailed data for policy decisions

» Timeliness: Improve punctuality of official statistics

» Relevance: Develop new research areas and questions

» Reduce response burden reduction of costs ?

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics

Background ESSnet Big Data II – WP L Remote Sensing Data Mobile Network Data Outlook

Page 15: Innovative data collection using  statistics

www.destatis.de

15

Thank You for your attention!

Institute for Research and Development in Federal Statistics

[email protected]

Maren Köhlmann

+49 (0) 611 / 75 42 95

[email protected]

© Federal Statistical Office of Germany (Destatis) | Institute for Research and Development in Federal Statistics