32
© King.com Ltd 2013 Commercially confidential Making Big Data Useful Data Speaker Anna Bongenhielm

Making Big Data Useful Data - GOTO Con

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

© King.com Ltd 2013 – Commercially confidential

Making Big Data

Useful DataSpeaker Anna Bongenhielm

© King.com Ltd 2013 – Commercially confidential

© King.com Ltd 2013 – Commercially confidential

Anna BongenhielmProducer Data Analytics @ King

4

Little over 4 years with King

Mainly worked with

reporting and downstream

data chain

Personal favorite King

game right now – Candy

Crush Soda Saga

© King.com Ltd 2013 – Commercially confidential

We make great games

About King

Page 5

• We have developed more than 200 fun titles and offer games in over 200 countries

and regions around the world.

• 340 million average monthly unique users (Q2 2015).• Studios in Stockholm,

London, Barcelona,

Bucharest, Malmo, Berlin,

Singapore and Seattle.

• Offices in San Francisco,

Malta, Tokyo, Seoul and

Shanghai.

© King.com Ltd 2013 – Commercially confidential

The evolution of King

About King

Page 6

• Founded in 2003

• Originally games were only available through our site and portals including AOL and Yahoo!

Online skill Social Mobile

• Launched first game on

Facebook in Q2 2011• Launched first game on

mobile H2 2012

© King.com Ltd 2013 – Commercially confidential

Presentation overview1. The Daily Pipeline

2. Brief Reporting History

3. Dealing with Data Size in the Reporting Layer

4. Governing the User End of a Big Data Environment

5. Getting the Stakeholders the Right Thing

7

Credit goes to Jonathan Palmer and Jakob Berglund who

have contributed several of the slides in this presentation

© King.com Ltd 2013 – Commercially confidential

The Daily Pipeline

8

© King.com Ltd 2013 – Commercially confidential 9

Qlikview says

no

Infobright

CE says no

10 node

Hadoop

80 nodes

40 nodes

20 nodes

Yes, our data volume has grown significantly over time…

Fast changing times

© King.com Ltd 2013 – Commercially confidential

Tracking measures what players do

10

What is tracking really?

We currently use ~415 event types

(415 measurement points)

For example…

…when players start the game

…when players start a level

…who players interact with

…what players buy

…and a lot of other stuff

More than a 100 million daily players…

…create over than 15 billion events

daily!

© King.com Ltd 2013 – Commercially confidential

Our fancy data pipeline!

(much wow!)

11

What does our pipeline look like?

Game

servers

Log

server

Report

s

Data

scientis

ts

Data Warehouse

TSV log

files

Dimensional

model

Raw

dataETL

Hadoop

ExaSolution

wow such amaze much boxes very statistics so mystery

© King.com Ltd 2013 – Commercially confidential

Brief Reporting History

12

© King.com Ltd 2013 – Commercially confidential

In the beginning:

13

Most data processed directly in

Qlikview

Few developers, all located in the

same office

No need for a DWH

Information and dimensionality on a

user level basis

© King.com Ltd 2013 – Commercially confidential

The problems we face with size:

14

Adding more games

Adding new types of data

Knowing what to look for and

when to group the data

© King.com Ltd 2013 – Commercially confidential

Our QMC:

15

© King.com Ltd 2013 – Commercially confidential

Dealing with Data Size in the Reporting Layer

16

© King.com Ltd 2013 – Commercially confidential

Original architecture

17

Storage

QVD

Visualization

© King.com Ltd 2013 – Commercially confidential

Occasional architecture

18

Storage

Visualization

© King.com Ltd 2013 – Commercially confidential

Architecture with a logic layer

19

Storage

Logic

Data

QVD

Visualization

Master Data

© King.com Ltd 2013 – Commercially confidential

Architecture with a cohort layer

20

Storage

Logic

Data

Cohort

QVD

Visualization

Master Data

© King.com Ltd 2013 – Commercially confidential

User experience

21

Performance

Dependability

Trust

© King.com Ltd 2013 – Commercially confidential

Governing the User End of a Big Data Environment

22

© King.com Ltd 2013 – Commercially confidential

Our Framework: What issues does it

address?

• Governance

• Development overhead

• Scalability

• User engagement

23

© King.com Ltd 2013 – Commercially confidential

How do we do this?

• Global KPI expression dictionary

• Common code/tool sharing

• Holistic monitoring

• Governed folder structure

• Design consistency

24

© King.com Ltd 2013 – Commercially confidential

Tools – QV Helper

25

© King.com Ltd 2013 – Commercially confidential

Tools – Master Portal

26

Governing a big data environment

© King.com Ltd 2013 – Commercially confidential

Tools – Usage Insight

27

Governing a big data environment

© King.com Ltd 2013 – Commercially confidential

Tools – Usage Insight

28

Governing a big data environment

© King.com Ltd 2013 – Commercially confidential

Getting the Stakeholders the Right Thing

29

© King.com Ltd 2013 – Commercially confidential

Trade off between specialization and generalization

30

Don’t get locked in too much on

how it should be

Don’t be afraid to try things, and

iterate improvement

Make sure that it serves the

right purpose, not all

stakeholders need all

information – Smart Data

Don’t let future use limit what

you develop and use today

© King.com Ltd 2013 – Commercially confidential 31

Questions?

[email protected]

© King.com Ltd 2013 – Commercially confidential