Upload
dolead
View
208
Download
1
Embed Size (px)
Citation preview
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
Play with Data (1)Dolead & Google
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
2
Good News : Our Brain’s memory capacity is 10 times larger than we thought
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
3
Good News : Our Brain’s memory capacity is 10 times larger than we thought
Basically the whole Internet
SALK INSTITUTE - 20 January 2016
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
4
Plan
Product Manager at Dolead
A short history of Big Data
How define Big Data
How we find value in data Hadrien Baradel
A short history of Big Data
How to define Big Data
How do we find value in data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
5
WHAT IS DATA ?
DATA = EVENT + CONTEXT
VALUE-DRIVING DATA = EVENT + CONTEXT + METRICS
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
A SHORT HISTORYof Big Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
7
ALL BEGINS WITH INFORMATIONS AND LIBRARIES
300 BCE - 48 AD : Library of Alexandria is the world’s largest data storage center
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
8
IS BIG DATA REALLY NEW?
« Information Explosion »
A term first used in 1941 (According to Oxford English Dictionary)
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
9
ALL BEGINS WITH INFORMATIONS AND LIBRARIES
1944 - Fremont Rider speculates that Yale Library will contain 200 million books stored in 6’000 miles by 2040
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
10
JUST MISSED SOMETHING
1991 - The Birth of Internet
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
11
IS BIG DATA REALLY NEW?
1989 «BIG DATA»
Early use of terms in magazine article by a ficton author Erik
Larson
Commenting on Advertisers’ use of data to target customers
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
12
IS BIG DATA REALLY NEW?
2010 Eric Schmidt
“Much data is now being created every two days, as was created from beginning of human civilization to
the year 2003“
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
13
IS BIG DATA REALLY NEW?
2015 : Information doubleMore data has been created in the past two years than in the
entire previous history of the human race
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
CHARACTERIZATIONof Big Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
15
BIG DATA DEFINITION : THE V3s
Volume
Data quantity
Velocity
Data Speed
Variety
Data Types
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
16
WHAT IS A ZETTABYTE
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
1 000 000 000 000
terabytes
gigabytes
petabytes
exabytes
zettabyte
1 Terabyte = 250 DVD
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
17
HOW BIG IS BIG DATA ?Size of Total Data
Entreprise Managed Data
Entreprise Created Data
Source IDC
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
18
HOW BIG IS BIG DATA ?
2010
10 Gigabytes
Today
500 Terabytes / day
Today
240 Terabytes / flight
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
19
HOW FAST IS BIG DATA ?
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
Structured Data? An example
Before
Structured Data
Generated By companies
Updated every Month
SQL
20
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
Unstructured data?
After
Unstructured Data
Generated By Users
Real Time
NoSQL
21
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
22
THIS EXPLOSION OF DECENTRALISED DATA MEANS
2009 2010 2011 2012 2013 2014
Unstructured File-based Data Storage
Structured Block-based Data Storage
Before
Structured Data
Generated By companies
Updated every Month
SQL
Source IDC
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
23
THIS EXPLOSION OF DECENTRALIZED DATA MEANS
2009 2010 2011 2012 2013 2014
Unstructured File-based Data Storage
Structured Block-based Data Storage
After
Unstructured Data
Generated By Users
Real Time
NoSQL
Source IDC
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
HOW DO WE FIND VALUEin Big Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
HOW IS IT GROWING ?
Data production will be 44 times greater in 2020 than it was in 2009
70% of data created by users, 80% hold by companies
25
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
26
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
Internalized Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
27
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-structured Data
Internalized Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
28
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-structured Data
Internalized Data
External structured Data
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
29
DATA FOR ORGANISATIONS AND BUSINESSES, YOU ALREADY HAVE DATA
External and Non-structured Data
Internalized Data
External structured Data
Data for organisations and businesses
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
30
YOU HAVE A LOT OF DECISIONS TO MAKE
Marketing Channel
BudgetTargeting
New Product Development
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
31
USING DATA IS GOOD FOR YOUR BUSINESS
64% of the companies that invest in “ analytics “ over performs on averaged the other ones (S&P 500)
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
HOW DO WE FIND VALUE IN DATADATA IS NOT GOAL
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
33
How do we know that we made a great feature?
“ If you want to be a long-term sucess, built a great product ”
Sam Altman, Y Combinator
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
34
How do we follow metrics? And how to be sure that all services have the same metrics?
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
35
What is a key metric ? How to choose it ?
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
36
What is a key metric ? How to choose it ?
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
37
What we had before the new feature
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
38
What get thought with the ability to apply keywords to a group
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
39
And what we have learned
Opportunity Type < 1 w 1 2 3 4 5
Add Keywords 85.71% 42.86% 28.57% 28.57% 14.29 % 14.29 %
Feb 23rd;2016 - Apr 11th, 2016
Opportunity Type < 1 w 1 2 3 4 5
Add Adgrouping Keywords 88.24% 58.82% 47.06% 47.06% 41.18 % 35.29 %
Apr 13rd;2016 - Jun 18th, 2016
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
40
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link and compare large data sets
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
41
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link and compare large data sets
Analysis
Identify patterns
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
42
TO CONCLUDE : A BIG DATA DEFINITION
Technology
Maximising computation power and algorithmic accuracy to gather analyse, link and compare large data sets
Analysis
Identify patterns
Mythology ?
Widespread belief that large data sets offer a higher form of intelligence and knowledge that can generate insights that were previously impossible, with the aura of truth, objectivity and accuracy
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
DOLEAD & GOOGLE EVENT – JUL 2016WWW.DOLEAD.COM
DOLEADWe make digital advertizing easy
Prez Google - Play with Data (part 2)