Upload
gabriel-gaster
View
47
Download
1
Embed Size (px)
Citation preview
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?
review of literature
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?
review of literature
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
review of literature
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
review of literature
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?“a scientist who can code”
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?“a scientist who can code”
• lower barrier to attack new problems
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?“a scientist who can code”
• lower barrier to attack new problems • repeatable analysis
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
who is a data scientist?“a scientist who can code”
• lower barrier to attack new problems • repeatable analysis • freedom to think about problems new ways
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
Northwestern Data Visualization| @gabegaster | 2015 may
which were difficult to answer before
Northwestern Data Visualization| @gabegaster | 2015 may
computing has progressed
which were difficult to answer before
Northwestern Data Visualization| @gabegaster | 2015 may
1950computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis years
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis years
todaycomputing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis years
today
v
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis years
today
hoursv
v
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
1950
cost of new analysis years
today
same person thinking about the problem can conduct experiments to answer it
hoursv
v
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
standing on shoulders of giants
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
standing on shoulders of giants
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
standing on shoulders of giants
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
standing on shoulders of giants
reinventing the wheel
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
open-source code
standing on shoulders of giants
reinventing the wheel
computing has progressed
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
knowing what is possible
which were difficult to answer before
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
HOW
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
HOW WHY
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the righttools
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the rightasking whytools
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the rightasking whytools
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the rightasking whytools
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the rightasking whytools WHY
Northwestern Data Visualization| @gabegaster | 2015 may
what is data science?
using emerging technologies to approach problems scientifically
which were difficult to answer before
knowing what is possible
doing something useful
usingnew good
the rightasking whytools WHY
WHY
Northwestern Data Visualization| @gabegaster | 2015 may
why why whywhat is data science?
Northwestern Data Visualization| @gabegaster | 2015 may
why why whywhat is data science?
science is about asking why
Northwestern Data Visualization| @gabegaster | 2015 may
why why whywhat is data science?
science is about asking whystart there
Northwestern Data Visualization| @gabegaster | 2015 may
an anecdote
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
an examplefrom the real world
Northwestern Data Visualization| @gabegaster | 2015 may
• e
an example
Northwestern Data Visualization| @gabegaster | 2015 may
• what
what data do you have? how can we make a network?
Northwestern Data Visualization| @gabegaster | 2015 may
• what
what data do you have? how can we make a network?
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save money
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save money
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save money
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save money
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
abou
t pat
ent
not
abou
t pat
ent
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
abou
t pat
ent
not
abou
t pat
ent
turn over to plaintiffdon’t
turn over to plaintiff
adverse inference
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
abou
t pat
ent
not
abou
t pat
ent
turn over to plaintiffdon’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
abou
t pat
ent
not
abou
t pat
ent
turn over to plaintiffdon’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
turn over to plaintiffdon’t
turn over to plaintiff
goal: save moneytask: find needle in the haystack (without poking yourself)
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save moneyprototype
Northwestern Data Visualization| @gabegaster | 2015 may
goal: save moneyprototype — design for lawyers
Northwestern Data Visualization| @gabegaster | 2015 may
Sexier. Less nerdy. Tailored. design for transparency
Northwestern Data Visualization| @gabegaster | 2015 may
http://www.daegis.com/judicial-acceptance-of-tar/
Northwestern Data Visualization| @gabegaster | 2015 may
another examplecontests
Northwestern Data Visualization| @gabegaster | 2015 may
another example
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
task:
Northwestern Data Visualization| @gabegaster | 2015 may
classify schizophrenia w MRItask:
Northwestern Data Visualization| @gabegaster | 2015 may
why?classify schizophrenia w MRItask:
Northwestern Data Visualization| @gabegaster | 2015 may
why?classify schizophrenia w MRItask:
improve understanding of disease
Northwestern Data Visualization| @gabegaster | 2015 may
why?classify schizophrenia w MRItask:
improve understanding of diseasehow?
Northwestern Data Visualization| @gabegaster | 2015 may
why?classify schizophrenia w MRItask:
improve understanding of diseasehow? … outside contest purview
Northwestern Data Visualization| @gabegaster | 2015 may
why? outside contest purview
Northwestern Data Visualization| @gabegaster | 2015 may
why? outside contest purview
Northwestern Data Visualization| @gabegaster | 2015 may
why? outside contest purview
kaggle
Northwestern Data Visualization| @gabegaster | 2015 may
why? outside contest purview
kagglegetting data
& making usable
Northwestern Data Visualization| @gabegaster | 2015 may
why? outside contest purview
kagglegetting data
& making usable
WHY
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
Accuracy of Classification
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
AUC
Accuracy of Classification
Northwestern Data Visualization| @gabegaster | 2015 may
what is AUC?AU
C
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC? Area Under Curve
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC? Area Under Curve
what curve?
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
balances:
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
balances: True Positive RateFalse Positive Rate
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
balances: True Positive RateFalse Positive Rate
AUCwhat is AUC? Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
AUCwhat is AUC?
balances: True Positive RateFalse Positive Rate
Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
why?AUCwhat is AUC?
balances: True Positive RateFalse Positive Rate
Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
why?…
AUCwhat is AUC?
balances: True Positive RateFalse Positive Rate
Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
why?…
upshot:
AUCwhat is AUC?
balances: True Positive RateFalse Positive Rate
Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
why?…
choice of metric matters a LOT
upshot:
in practice
AUCwhat is AUC?
balances: True Positive RateFalse Positive Rate
Area Under Curve
what curve? Receiver Operating Characteristic
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
Accuracy of Classification
AUC
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
Accuracy of Classification
AUC
random guess
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
Accuracy of Classification
AUC
random guess
basic SVM
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
goal?
Accuracy of Classification
AUC
random guess
basic SVM
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
goal: depends on why
Accuracy of Classification
AUC
random guess
basic SVM
Northwestern Data Visualization| @gabegaster | 2015 may
random guess
basic SVM
timeline of contest
Accuracy of Classification
AUC
Northwestern Data Visualization| @gabegaster | 2015 may
me
timeline of contest
Accuracy of Classification
AUC
Northwestern Data Visualization| @gabegaster | 2015 may
me
timeline of contest
Accuracy of Classification
AUC turned out to place 9th — because overfitting
Northwestern Data Visualization| @gabegaster | 2015 may
me
timeline of contest
Accuracy of Classification
AUC turned out to place 9th — because overfitting
very common problem
Northwestern Data Visualization| @gabegaster | 2015 may
timeline of contest
Accuracy of Classification
worth it?
AUC
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
We need to reduce the costs of Service Requests. They are too expensive.
!!!!
Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine
the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now
automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few
super computers with TBs of memory.
client
an example !
from the industrial internet
Northwestern Data Visualization| @gabegaster | 2015 may
Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !
We need to reduce the costs of Service Requests. They are too expensive.
!!!!
Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine
the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now
automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few
super computers with TBs of memory.
client
Northwestern Data Visualization| @gabegaster | 2015 may
Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!!!
We need to reduce the costs of Service Requests. They are too expensive.
!!!!
Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine
the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now
automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few
super computers with TBs of memory.
client
Northwestern Data Visualization| @gabegaster | 2015 may
Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !
We need to reduce the costs of Service Requests. They are too expensive.
!!!!
Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine
the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now
automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few
super computers with TBs of memory.
client
Northwestern Data Visualization| @gabegaster | 2015 may
Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !
We need to reduce the costs of Service Requests. They are too expensive.
!!!!
Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine
the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now
automatically resolve 1% of new Service Requests. But we want more. That's why we bought a few
super computers with TBs of memory.
client
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
but it is important to know the right tool for the job
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
but it is important to know the right tool for the job
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
but it is important to know the right tool for the job
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
but it is important to know the right tool for the job
don’t start w hadoop unless you have to. !
Northwestern Data Visualization| @gabegaster | 2015 may
clienttools are not everything
but it is important to know the right tool for the job
don’t start w hadoop unless you have to. !
probably you don’t have to.
Northwestern Data Visualization| @gabegaster | 2015 may
client
How did you automate resolving Service Requests? !!!!!!!!!!!
Northwestern Data Visualization| @gabegaster | 2015 may
client
How did you automate resolving Service Requests? !!!!!!!!!!!
A group of senior engineers thought about different use cases and came up with a list of conditions that, if any
are met, lead to predetermined solutions.
Northwestern Data Visualization| @gabegaster | 2015 may
client
How did you automate resolving Service Requests? !!!!!!!!!!!
A group of senior engineers thought about different use cases and came up with a list of conditions that, if any
are met, lead to predetermined solutions. !
Took a year to create. !
Northwestern Data Visualization| @gabegaster | 2015 may
client
How did you automate resolving Service Requests? !!!!!!!!!!!
A group of senior engineers thought about different use cases and came up with a list of conditions that, if any
are met, lead to predetermined solutions. !
Took a year to create. !
We’ve been keeping track of every solved request for several years now.
Northwestern Data Visualization| @gabegaster | 2015 may
client
How did you automate resolving Service Requests? !!!!!!!!!!!
A group of senior engineers thought about different use cases and came up with a list of conditions that, if any
are met, lead to predetermined solutions. !
Took a year to create. !
We’ve been keeping track of every solved request for several years now.
from sklearn import naive_bayes as nb!nb.GaussianNB().fit(historical_requests,!! ! ! ! ! ! historical_decisions)
Northwestern Data Visualization| @gabegaster | 2015 may
client
This works really well! But we can’t use it. !!!!!!
Northwestern Data Visualization| @gabegaster | 2015 may
client
This works really well! But we can’t use it. !!!!!!
Oh. Why is that?
Northwestern Data Visualization| @gabegaster | 2015 may
client
This works really well! But we can’t use it. !!!!!!
Engineers don’t trust the predictions.
Oh. Why is that?
Northwestern Data Visualization| @gabegaster | 2015 may
client
This works really well! But we can’t use it. !!!!!!
Engineers don’t trust the predictions.
Oh. Why is that?
Northwestern Data Visualization| @gabegaster | 2015 may
an example
just for fun
Northwestern Data Visualization| @gabegaster | 2015 may
an example
just for fun
(a bit more depth this time)
Northwestern Data Visualization| @gabegaster | 2015 may
a typical trip for me
Northwestern Data Visualization| @gabegaster | 2015 may
Bus transit times = a LIE
Northwestern Data Visualization| @gabegaster | 2015 may
Chicago is a grid city
Northwestern Data Visualization| @gabegaster | 2015 may
Difficult
Public Transit on the grid
=+ Diagonals
Northwestern Data Visualization| @gabegaster | 2015 may
Difficult
Public Transit on the grid
=+ Diagonals
2+ buses = FAIL
Northwestern Data Visualization| @gabegaster | 2015 may
Adding bikes to public transit
= win
Northwestern Data Visualization| @gabegaster | 2015 may
show how has divvy changed where people
can go
viz Goal:
Northwestern Data Visualization| @gabegaster | 2015 may
show how has divvy changed where people
can goshow where people
actually go
viz Goal:
Northwestern Data Visualization| @gabegaster | 2015 may
How to show where people go from here?
Northwestern Data Visualization| @gabegaster | 2015 may
one way is with an arrow
A B
How to show where people go from here?
Northwestern Data Visualization| @gabegaster | 2015 may
good for abstract relationships
one way is with an arrow
Northwestern Data Visualization| @gabegaster | 2015 may
hard to decipher
Northwestern Data Visualization| @gabegaster | 2015 may
lines between pts?
@flowingdata
Northwestern Data Visualization| @gabegaster | 2015 may
emphasizes traffic
@flowingdata
lines between pts?(the lines superimpose)
Northwestern Data Visualization| @gabegaster | 2015 may
emphasizes traffic
@flowingdata
lines between pts?
beautiful map(the lines superimpose)
Northwestern Data Visualization| @gabegaster | 2015 may
emphasizes traffic
@flowingdata
lines between pts?
beautiful map(the lines superimpose)
— but not suited for this goal
Northwestern Data Visualization| @gabegaster | 2015 may
lines between pts?
Northwestern Data Visualization| @gabegaster | 2015 may
lines between pts?
Northwestern Data Visualization| @gabegaster | 2015 may
lines between pts?
how to represent stations?
Northwestern Data Visualization| @gabegaster | 2015 may
lines between pts?
on a maphow to represent stations?
Northwestern Data Visualization| @gabegaster | 2015 may
but how?
lines between pts?
on a maphow to represent stations?
Northwestern Data Visualization| @gabegaster | 2015 may
can use gradient — to show gradual differences
between stationsLondon transit map
@mySociety
Northwestern Data Visualization| @gabegaster | 2015 may
@mbostock
or use natural borders?
London transit map
@mySociety
Northwestern Data Visualization| @gabegaster | 2015 may
what regions?
Northwestern Data Visualization| @gabegaster | 2015 may
each point is related to the closest station
what regions?
Northwestern Data Visualization| @gabegaster | 2015 may
each point is related to the closest station
what regions?
—> Voronoi
Northwestern Data Visualization| @gabegaster | 2015 may
each point is related to the closest station
what regions?
—> Voronoihuh?
Northwestern Data Visualization| @gabegaster | 2015 may
each point is related to the closest station
what regions?
—> Voronoihuh? http://alexbeutel.com/webgl/voronoi.html
Northwestern Data Visualization| @gabegaster | 2015 may
each point is related to the closest station
what regions?
—> Voronoihuh? http://alexbeutel.com/webgl/voronoi.html
Find the closest station — that’s my region!
Northwestern Data Visualization| @gabegaster | 2015 may
Czech beer
Northwestern Data Visualization| @gabegaster | 2015 may
Czech beer
starbucks
@gabegaster | http://bitly.com/bundles/gabegaster/1
recap: stations are voronoi tiles
@gabegaster | http://bitly.com/bundles/gabegaster/1
recap: stations are voronoi tiles
bubbles
@gabegaster | http://bit.ly/1pdP2Tb
recap: stations are voronoi tiles
too many edges to show…
@gabegaster | http://bit.ly/1pdP2Tb
recap: stations are voronoi tiles
too many edges to show… !
what about using color?
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
• two color scale
colors v colors
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
binned v gradient
colors v colors
• two color scale
@gabegaster | http://bit.ly/1pdP2Tb
• hard to read • differences subtle
gradient
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
• two colors not many • binned not gradient
binned v gradient
colors v colors
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
• two colors not many • binned not gradient
binned v gradient
colors v colors
binned
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
• two colors not many • binned not gradient • transparent empty bin
binned v gradient
colors v colors
binned
@gabegaster | http://bit.ly/1pdP2Tb
how touse color?
• two colors not many • binned not gradient • transparent empty bin • iterate
binned v gradient
colors v colors
binned
@gabegaster | http://bitly.com/bundles/gabegaster/1
• dispersion (where people can / do go)
with these choices, the viz emphasizes:
• exploration
@gabegaster | http://bitly.com/bundles/gabegaster/1
what can we learn from this?
urban vs lake frontcommute vs party
vs train
there’s me, too.hipster vs yuppie
@gabegaster | http://bit.ly/1pdP2Tb
Great! This looks great! !
But it’s too much at once.
@gabegaster | http://bit.ly/1pdP2Tb
Great! This looks great! !
But it’s too much at once.
make it interactive
@gabegaster | http://bit.ly/1pdP2Tb
Great! This looks great! !
But it’s too much at once.
make it interactive
divvy.datasco.pe
Northwestern Data Visualization| @gabegaster | 2015 may
new question
Northwestern Data Visualization| @gabegaster | 2015 may
How are stations different?
Northwestern Data Visualization| @gabegaster | 2015 may
How are stations different?
who uses it
Northwestern Data Visualization| @gabegaster | 2015 may
How are stations different?
when is the station usedwho uses it
Northwestern Data Visualization| @gabegaster | 2015 may
How are stations different?
when is the station usedhow it used
who uses it
Northwestern Data Visualization| @gabegaster | 2015 may
How are stations different?
when is the station usedhow it used
who uses it
use the time signature of a station
Northwestern Data Visualization| @gabegaster | 2015 may
Time Signature of a station
Northwestern Data Visualization| @gabegaster | 2015 may
Time Signature of a station
http://divvy.datasco.pe/multiline/
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
questions?
Northwestern Data Visualization| @gabegaster | 2015 may
in conclusion
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
Northwestern Data Visualization| @gabegaster | 2015 may
thanks!@gabegaster