55
PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig Mannaert, University of Antwerp, Belgium. Alexander Mirnig, University of Salzburg, Austria. Charles Perez, Paris School of Business, France. Hans-Werner Sehring, Namics, Germany.

Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

PANEL ON PATTERNS/CONTENT

Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017

Herwig Mannaert, University of Antwerp, Belgium. Alexander Mirnig, University of Salzburg, Austria. Charles Perez, Paris School of Business, France. Hans-Werner Sehring, Namics, Germany.

Page 2: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Agenda.

FIRST Namics in a Nutshell. SECOND Proposition.

Page 3: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

FIRST

Namics in a Nutshell.

Page 4: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics. 4

Jürg Stuker CEO NAMICS

“We are pioneers and experts in the field of digital transformation – and we have been since 1995. As an independent, interdisciplinary full-service partner, we work with you to digitize your business models and critical processes. Your long-term success is the focus of everything we do.”

Page 5: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics. NAMICS IN A NUTSHELL

Our services. 5

E-Commerce

Digital Communications

Website & Portals

Mobile Business Apps

Digital Workplace

Managed Services

DIGITAL ENTERPRISE

& BRAND

DIGITAL BUSINESS

DIGITAL EXPERIENCE

DIGITAL ENTERPRISE

& BRAND

DIGITAL BUSINESS

DIGITAL EXPERIENCE

Page 6: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics. NAMICS IN A NUTSHELL

Since 1995. Close proximity. Namics. 6

6 LOCATIONS Frankfurt

Hamburg

Zurich Belgrade

Munich

Saint Gallen

Page 7: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

SECOND

Proposition.

Page 8: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics.

Proposition.

Data in databases is typically created according to a schema.

Proposition: To analyze big data collections, a schema might be used “in reverse” in order to match patterns in an existing data collection.

Are there patterns that are similar to a schema/type system? Is pattern matching in these case a type checking task?

Then, pattern matching could be performed as “duck typing”.

Bottom line: We are speaking about a rich “schema”/“type system” here, including higher-level concepts, and constraints, rules, laws.

8

Page 9: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics.

Some Background from Semiotics and from Philosophy.

Peirce’s triadic Semiotics.

Building up Firstness, Secondness, and Thirdness concepts.

Cassirer’s Epistemology.

Co-evolution of instances and their classification.

Plus, from a computer science perspective, comp. Joseph Goguen.

Two kinds of classifications: formal structure and meaning.

9

Page 10: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics.

If I had to solve it: M3L as a “Pattern Recognition” Language.

The Minimalistic Meta Modeling Language (M3L) provides such rich modeling capabilities.

It may be worth the attempt to map it to (existing) Big Data collections.

10

Page 11: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Namics.

M3L as a “Pattern Recognition” Language. First Try.

Trivial example: Detect recurring visitors in tracking data. Q: Do we have recurring visitors? (And how many?)

Visitor { Visit is a Time from …; Identifier is a Cookie from …; } RecurringVisitor is a Visitor { Visitor1 is a Visitor; Visitor2 is a Visitor; Identifier from Visitor1 is the Identifier from Visitor2; Visits; } |= RecurringVisitor is the Visitor1, the Visitor2 { Visit from Visitor1 is a Visits; Visit from Visitor2 is a Visits; }

11

Page 12: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Thank you. Namics. © N A M I C S A G 2 0 1 7

[email protected]. Senior Solution Architect.

Page 13: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Patterns: Problems, Solutions, and Everything In-between About finding the right problem descriptions and the difficulty of proving state-of-the-art solutions.

Page 14: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Content

•  Background and Automotive Design

•  Design Patterns – a brief overview

•  Finding the right problem for your solution

•  Proven solutions in a rapidly evolving environment

•  Conclusion

PATTERNS ’17 Panel 2 02/21/17

Page 15: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Background

Center for Human-Computer-Interaction, University of Salzburg, Austria

Background: General Philosophy of Science and Science of Consciousness Interdisciplinary Workgroup Neurosignaling, Department of Zoology, University of Salzburg Since 2012: Center for HCI Main topics: (Semi-)autonomous vehicles and persuasive interfaces, interface evaluation (Usability and User Experience), definitions and formal approaches in HCI, in-vehicle UIs, theories of consciousness

PATTERNS ’17 Panel 3 02/21/17

Page 16: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Background – Center for HCI in Salzburg

PATTERNS ’17 Panel 4 02/21/17

established as a research

unit at the ICT&S

Center 2005

education science

communication science

psychology

sociology

division at the

Department of

Computer Sciences

2015

computer linguistics

computer science multimedia

interface design

interaction design ~35

people

special institution

at the University of

Salzburg 2016

Page 17: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Automotive Design

PATTERNS ’17 Panel 5 02/21/17

* © Arno Laminger 2015

Page 18: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Design Patterns - Overview

Short Definition: •  A (design) pattern is a structured documentation to

a proven solution to a reoccurring problem, embedded in at least one of the contexts it occurs in.

Some advantages: •  they capture expertise and make it accessible to

non-experts •  their names collectively form a vocabulary that

helps developers communicate better.

PATTERNS ’17 Panel 6 02/21/17

Page 19: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

Evolution of the problem description 1)  Where should what be displayed?

PATTERNS ’17 Panel 7 02/21/17

Page 20: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

Evolution of the problem description 1)  Where should what be displayed?

2)  Where should displays be positioned in the cockpit?

PATTERNS ’17 Panel 8 02/21/17

Page 21: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

Evolution of the problem description 1)  Where should what be displayed?

2)  Where should displays be positioned in the cockpit?

3)  Where does a driver look first? Which areas are quicker to access for the eyes than others?

PATTERNS ’17 Panel 9 02/21/17

Page 22: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

Final Problem Statement: •  Information sources are spread throughout the

cockpits of cars (instrument cluster, center console, in and around the steering wheel). Differently sized displays in different positions make it difficult for the driver to locate the right information at the right time, thus being a potential source of distraction for the driver.

PATTERNS ’17 Panel 10 02/21/17

Page 23: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

Final Problem Statement: •  Information sources are spread throughout the

cockpits of cars (instrument cluster, center console, in and around the steering wheel). Differently sized displays in different positions make it difficult for the driver to locate the right information at the right time, thus being a potential source of distraction for the driver.

PATTERNS ’17 Panel 11 02/21/17

Page 24: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

What happened? •  As we figure out regarding what we expect from a good

solutions, the problem statement gets more and more refined.

PATTERNS ’17 Panel 12 02/21/17

Page 25: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

What happened? •  As we figure out regarding what we expect from a good

solutions, the problem statement gets more and more refined.

•  Ref. Philosophy of Science: A question is a good question if we can state what an answer needs to fulfill in order to answer the question either positively or negatively

PATTERNS ’17 Panel 13 02/21/17

Page 26: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Finding the right Problem

What happened? •  As we figure out regarding what we expect from a good

solutions, the problem statement gets more and more refined.

•  Ref. Philosophy of Science: A question is a good question if we can state what an answer needs to fulfill in order to answer the question either positively or negatively

•  E.g. What is (the) truth? Is it an object, a linguistic construct, a mathematical function on a set of syntax?

PATTERNS ’17 Panel 14 02/21/17

Page 27: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution

What does it mean for a novel technology to be proven?

•  The Rule of Three!

PATTERNS ’17 Panel 15 02/21/17

Page 28: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution

What does it mean for a novel technology to be proven?

•  The Rule of Three!

•  But:

PATTERNS ’17 Panel 16 02/21/17

Page 29: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution

What does it mean for a novel technology to be proven?

•  The Rule of Three!

•  But: •  Do prototypical implementations count?

PATTERNS ’17 Panel 17 02/21/17

Page 30: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution

What does it mean for a novel technology to be proven?

•  The Rule of Three!

•  But: •  Do prototypical implementations count?

•  What if the solution is based on one implementation and lab data?

PATTERNS ’17 Panel 18 02/21/17

Page 31: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution - Proposal

A novel technology solution is proven if: •  It satisfies the Rule of Three

or:

PATTERNS ’17 Panel 19 02/21/17

Page 32: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution - Proposal

A novel technology solution is proven if: •  It satisfies the Rule of Three

or: •  It has been successfully implemented at least once in a

setting close to its real application context

and

PATTERNS ’17 Panel 20 02/21/17

Page 33: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Proving the solution - Proposal

A novel technology solution is proven if: •  It satisfies the Rule of Three

or: •  It has been successfully implemented at least once in a

setting close to its real application context

and

•  There is compelling evidence to expect the solution to work in other application instances.

PATTERNS ’17 Panel 21 02/21/17

Page 34: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Summing Up

•  Good answers require good questions being asked

PATTERNS ’17 Panel 22 02/21/17

Page 35: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Summing Up

•  Good answers require good questions being asked

•  It should be possible to provide a provenness-criterion more suitable for rapidly changing domains and environments.

PATTERNS ’17 Panel 23 02/21/17

Page 36: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

http://hci.sbg.ac.at http://hci.sbg.ac.at http://hci.sbg.ac.at

Summing Up

•  Good answers require good questions being asked

•  It should be possible to provide a provenness-criterion more suitable for rapidly changing domains and environments.

Let me know what you th ink!

PATTERNS ’17 Panel 24 02/21/17

Page 37: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Contact Mag. Alexander G. Mirnig Center for Human-Computer Interaction University of Salzburg Jakob-Haringer-Straße 8 / Techno 5 5020 Salzburg, Austria +43 662 8044/4834 [email protected] http://hci.sbg.ac.at/mirniga

Page 38: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Patterns and Big Data: Finding a Needle in a Haystack Detecting Patterns in Big Social Network Data

Charles PerezPSB Paris School of Business,

Chair D3 Digital, Data, Design, Paris, France

email:[email protected]

PATTERNS 2017, February 23, 2017 - Athens, Greece

Page 39: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Social network analysis before Big Data

• Euler, 1741: Concept of graph (7 Bridges of Konigsberg) • Moreno, 1930’s: modelling of social relationships as a graph

(sociogram, sociometry) • Simmel, 1950’s: importance of nature of relationships more than a

group itself • Barnes, 1954: Social network (Parish in western Norway) • Milgram, 1967: Small worlds (Familiar Stranger) • White, 1976: Matrix algebra • Granovetter, 1973: Strength of weak ties (micro vs macro in

sociology)

Sociology, Psychology, Mathematics

The main issue was to grab/find social data

Page 40: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Social network analysis & patterns today A big data issue

• Social networks (e.g. society) are digitalised (OSN) • Permanent connectivity: online social networks, smartphones • Permanent memory: activities are stored as computer data (with

context: content, space, time, etc.)

• OSN are composed of billions of users with billions of messages exchanged everyday (Big Data)

• New challenges & opportunities for patterns discovery

• Implications to business, management, sociology, computer science, mathematics, etc.

The issue is to get value/patterns from Big Data

Page 41: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Network science: a key to patterns discovery?

• Modelling complex systems with a basic graph model denoted G(N,E) • N = Nodes representing social actors • E = Edges representing interactions

Page 42: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

OSN patterns discovered in Big Data graphs

• Many OSN have been proved to be: • Small worlds: small

average distance and high clustering coefficient

• Scale-free: Power law degree distribution

• Applications to contagion, influence, buzz, etc.

Ingr

edie

nts

: Tim

e &

pref

eren

tial a

ttach

men

t

From : Network Science ALBERT-LÁSZLÓ BARABÁSI

Page 43: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Questions to be answered and considered for Big data approach to social network analysis

• Generally • What data is valuable for patterns search? • How to handle Big Data approach with privacy of users?

• Social Network Analysis • Identification of the right connections / interactions? • Identification of the right analysis scale (space/time)? • Automation of patterns detection? • What is the source of preferential attachment? • What are the limits to human patterns modelling?

• Open questions • Are there patterns everywhere in OSN? • What is the validity, lifetime of patterns? • The scaling of patterns (e.g. migration models)? • What is the place of randomness in patterns?

Page 44: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

Patterns and Big DataFinding a Needle in a Haystack

Some Skeptical Thoughts

Herwig Mannaert

University of Antwerp

Normalized Systems Institute

PATTERNS 2017

February 23

Page 45: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

1

My Research

• Study modular structures under change:

- using systems theoretic stability, thermodynamicentropy, and plain combinatorics

- to avoid “combinatorial effects” that impede reuse:

• duplications through lack of separation of concerns

• ripple effects through various types of coupling

• Examples:

- Software: law of increasing complexity / lack of reuse

- Education: duplications in content and descriptions

- Financial: creation of duplicate ledgers due to reporting

- Legislation: increasing amount of impacts of new laws

- …

Page 46: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

2

~ Patterns and Big Data

• Importance of patterns:

- In order to avoid combinatorial effects, we canderive very stringent constraints

- Designing evolvable modular structures withoutcombinatorial effects requires domain patterns

• E.g. software, education, financial reporting

• Challenges of big data:

- In order to master complexity, engineers needhierarchical structure

• E.g microprocessors, rockets, software systems

- Structure is in general both multi-dimensional andhas many layers of abstraction

Page 47: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

3

Thought 1: On Causality

Once upon a time:

Not every correlation is causal. For instance, the correlationbetween people falling down the stairs and having gray hair,is not causal.

— Professor Probability Theory

Page 48: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

4

Thought 1: On Causality

• The continued quest for causal relations isbased on the discovery of correlations:

- Student major in high school as a prerequisite

- Work absence as a measure of poor health

- …

• Could it be that the sheer massive amount ofcorrelations that are being probed today, couldjeopardize the validity of traditional measuresfor statistical relevance ?

Page 49: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

5

Thought 2: On Relevance

Once upon a time:

The human genome is complex and often startling. Forinstance, the correlation seems higher between a humangenome and that of a banana, than with the one of a frog.

— Professor Human Genetics

Page 50: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

6

Thought 2: On Relevance

• The continued quest for causal relationsand/or patterns is based on low-level data:

- Individuals clicking on individual links

- Individuals registering an interest

- …

• Could it be that on a bit-per-bit basis, thecorrelation between a Shakespeare play andan adult movie is higher, than the correlationbetween two Shakespeare plays ?

Page 51: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

7

Thought 2: On Relevance

From bitsto bit groups Digital Assets

Text Audio/VideoFrom charsto char words

uk fr …From wordsto sentences

pla

y

poem

Page 52: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

8

Thought 3: On Accordance

Once upon a time:

The purpose of information systems is to create a mirror ofthe world. In this way, we establish a digital reproduction ofthe world, and make it available for scientific research.

— Professor Information Systems

Page 53: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

9

Thought 3: On Accordance

• The continued quest for data patterns and/orpredictions is based on existing models:

- Entries of existing databases

- Traces of existing transaction systems

- …

• Could it be that the mirrors of informationsystems today exhibit severe scattering anddiffraction, due to increasing complexity andstructure degradation ?

Page 54: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

10

Thought 4: On Suitability

Once upon a time:

Why do we need a live transaction system to tell us the stateof a specific IoT device ? The big data lake is able to tell useverything we want to know.

— Software Solutions Architect

Page 55: Panel Computation World 2017. Patterns and Big Data ... · PANEL ON PATTERNS/CONTENT Patterns and Big Data: Finding a Needle in a Haystack. ATHENS, GREECE, 23RD OF FEBRUARY 2017 Herwig

11

Thought 4: On Suitability

• The continued quest for data patterns and/orpredictions is based on history data:

- Entries of existing databases

- Traces of existing transaction systems

- …

• Could it be that sometimes, some people turnto statistical estimations and predictions, whenthe required information is simply available orcan easily be computed ?