14
Big Data Research and Innovation A Gap Analysis from the Perspective of the European Big Data Value PPP Andreas Metzger (paluno @ U Duisburg-Essen, BDVA) PICASSO 2017, Minneapolis

Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Big Data Research and InnovationA Gap Analysis from the Perspective of the European Big Data Value PPP

Andreas Metzger (paluno @ U Duisburg-Essen, BDVA)

PICASSO 2017, Minneapolis

Page 2: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Background: Big Data Value PPP and BDVA

BDV PPP = Public Private Partnership• Public side = European Commission – EC• Private side = BDVA• Develop new Big Data technologies, products and

services to give European industry leading position• Leverage public funding with additional investments• Based on roadmaps for research and innovation• Implemented through calls for actions under H2020

BDVA = Big Data Value Association

• 169 members from 28 different countries• 55% industry/other

• 45% academia/research

2Source: http://www.kowi.de/kowi/verbundforschung/Partnerschaften/cppps/cppps.aspx

Page 3: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Agenda• BDVA Research Priorities

• Gap Analysis of ongoing R&I actions

• Conclusion

PICASSO 2017, Minneapolis

Page 4: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

BDV PPP Roadmap for research and innovationBDVA Strategic Research and Innovation Agenda (SRIA)

Available fromhttp://www.bdva.eu/

Technical Priorities:• Data Management• Data Processing Architectures• Data Analytics• Data Visualisation and User Interaction• Data Protection

Non-technical Priorities:• Skills development • Ecosystems and Business Models• Policy, Regulation and Standardisation• Social perceptions and societal implication

Page 5: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

BDVReferenceModel

Note:Horizontals donot implylayered architecture

Data Protection

Engineering & DevOps Standards

Data Processing Architectures

Batch InteractiveStreaming/Real-time

Other

Data Analytics

Descriptive Predictive Prescriptive

Data Management

Collection / Ingestion

Preparation/Curation

AccessLinking/

Integration

Data Visualisation and User Interaction1D 2D 3D

VR/AR

1+1D 2+1D 3+1D

Aim: “Structuring” the Priorities

Page 6: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Agenda• BDVA Research Priorities

• Gap Analysis of ongoing R&I actions

• Conclusion

PICASSO 2017, Minneapolis

Page 7: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Gap!AEGIS AEGISSODA BDOcean

DataBio

TT TT

TT TT TT

TTAEGIS

AEGISAEGIS

AEGIS AEGIS

AEGISAEGIS

AEGIS

AEGIS

euBizGr

euBizGr euBizGr

euBizGreuBizGr

euBizGreuBizGr

EWSEWS EWS

EWS EWS

FashBrn

FashBrn

FashBrn

FashBrn FashBrn FashBrn

FashBrn FashBrn

SODA

SODA

SODAQROWD

QROWD

QROWD

BDOcean

BDOcean

BDOcean

BDOcean

BDOcean

BDOcean

BDOcean

Gap!Technology Map ofongoing BDV PPP actions Gap!

Gap!

Technicalprojects of ongoing PPP actionsanalysed

= project developsnew technology

Page 8: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Drilldown: „Engineering & DevOps“

• Many Big Data technologies (“building blocks”) and “platforms” available (open source and commercial)

• But: Lack of…• …proven and empirically sound development and

engineering paradigms and methodologies for building next generation Big Data systems

• …continuous development and operations (DevOps) approaches and techniques for Big Data Value systems (considering different life-cycles of data and software)

• …end-to-end development, deployment and operations tool chains

8PICASSO 2017, Minneapolis

Page 9: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Drilldown: „Engineering & DevOps“

(1) Engineering methodologies and tooling

• Systematic integration of diverse, multi-disciplinary aspects of data analytics, system development, quality assurance and operations

• i.e., beyond CRISP-DM, OLAP, …• E.g., support for business analyst to express analytics

questions and then seamlessly move to implementation

• Helps addressing skills / experience differences

(Some) Challenges:

• How to accommodate the diverse “data professional” roles (e.g., data scientist, business/data analyst, …) as new stakeholders during systems engineering processes?

• How to integrate different concepts, techniques and tools along software/system life-cycle?

• How to reconceptualise role of requirements engineering if big data insights not known in advance?

9PICASSO 2017, Minneapolis

ENGINEER

Page 10: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Drilldown: „Engineering & DevOps“

(2) Quality Assurance

• Techniques and tools for delivering trustworthy and reliable Big Data systems

• Testing, verification, monitoring, self-diagnosis, …

(Some) Challenges:• How to test data-intensive systems?

• Volume: how to generate sufficient / representative test cases (if at all)?

• Velocity: how to monitor / self-diagnose at run time?

• Veracity: how to assure system quality if “uncertain” about data quality?

• How does testing data-intensive systems relate to (cross-)validation done in data analytics?

• How to provide quality guarantees, e.g., via formal verification, to give confidence in the systems’ quality?

10PICASSO 2017, Minneapolis

Page 11: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Drilldown: „Engineering & DevOps“

(3) DevOps++• Facilitate continuous release and

improvement of big data systems• Tight integration of system

development, data analytics and cloud/fog deployment/operations

(Some) Challenges:

• How to reconcile the different cycle times of data life-cycle vs. system life-cycle?

• How to incorporate and align dynamic self-adaptation (e.g. dynamic cloud resource management vs. business process adaptation based on analytics)?

• How to ensure data protection along various life-cycles and dynamic adaptations?

11PICASSO 2017, Minneapolis

DATA

CODE SYSTEM

Page 12: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

„Engineering & DevOps“ Gap also in the US?

12PICASSO 2017, Minneapolis

[PICASSO Opportunity Report, 03/2017]

Gap? Joint opportunity?

Page 13: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Agenda• BDVA Research Priorities

• Gap Analysis of ongoing R&I actions

• Conclusion

PICASSO 2017, Minneapolis

Page 14: Big Data Research and Innovation A Gap Analysis from the ...generation Big Data systems •…continuous development and operations (DevOps) approaches and techniques for Big Data

Conclusion

Gap!

Gap!

Gap!

Gap!

Another Opportunity

R&I Opportunities: