Presented by:
Data Visualization
Jeffrey A. Shaffer
Vice President, IT and Analytics, Unifund
Adjunct Professor, University of Cincinnati
Visualization and Data Presentation
www.wayfind.com/napoleon.html
Visualization and Data PresentationOn Minard’s graph:
“This is one of the worse graphs ever made. He [Tufte]'s very happy because it shows five different pieces of information on three axis and if you study it for fifteen minutes it really is worth a thousand words. I don't think that's what graphs are for. I think they try to make a point in two seconds for people who are too lazy to read the forty words underneath.”
- Seth Godin
“Dumbing it down for the executive”
Recreation of Jorge Camoes Redesign (www.ExcelCharts.com)
Visualization and Data Presentation
Image retrieved from the Library of Congress. http://hdl.loc.gov/loc.gmd/g3201e.ct000242
Drawing of the approximate representation of world emigration for the year 1858 by Charles Minard, published in 1862
Early 19th Century: Charles Minard Visualization and Data Presentation
Every Few Months
Never
Every Few Weeks
Every Few Days
Daily
Default in Excel
42%
29%
19%
8%
Every Few Months
Never
Every Few Weeks
Every Few Days
Daily 42%
29%
19%
8%
2%
Every Few Months Never Every Few Weeks Every Few Days Daily
Frequency
42%
29%
19%
8%
2%
Every Few Months
Never
Every Few Weeks
Every Few Days
Daily
Frequency
TargetEvery Few
MonthsNever
Every FewWeeks
Every FewDays
Daily
Frequency 42% 29% 19% 8% 2%
Cumulative % 42% 71% 90% 98% 100%
71%
0%
20%
40%
60%
80%
100%
0%
10%
20%
30%
40%
50%
Cumulative %
Frequency71% of occurrences are caused by low frequency
Time Utilization Chart Visualization and Data Presentation
The KISS Principle:
Keep it Simple, Stupid!
“It seems that perfection is reached not when there is nothing to add, but when there is nothing left to take away.”
- Antoine de Saint Exupéry
“When we understand that slide, we’ll have won the war.” -General Stanley McChrystal
Politics in the Middle East
Clear - easily seen; sharply defined
• who's the audience? what's the message?
• clarity more important than aesthetics
Clean - thorough; complete; unadulterated
• labels, axis, gridlines, formatting, right chart type, color choice, etc.
Concise - brief but comprehensive
• not minimalist but not verbose
Captivating - to attract and hold by beauty or excellence
• does it capture attention? is it interesting? does it tell the story?
The Shaffer 4 C’s of Data Visualization
Visualization and Data Presentation
Figure out the messageThen apply the tools
Figure out the messageThen apply the tools
85%
15%
Percentage of Chart which Looks like PacMan
Looks Like PacMan
Does Not Look LikePacMan
Looks like PacmanDoes not look like
Pacman
% 15% 85%
0%
25%
50%
75%
100%% of Chart that looks like Pacman
0%
1/3
2/3
Rectangle?
0%
25%
50%
75%
100%
Full Container 3/4 Container 1/2 Container 1/4 Container
Ingeniousness
Amount of Pringles Left
How Ingenious a Pringles Can is:
This is
Awesome!
Hmm, this
is getting
awkward.
Crap, my
hand got
stuck!
@#$% it,
I'll just
drink them.
0%
25%
50%
75%
100%
Full Container 3/4 Container 1/2 Container 1/4 Container
Ingeniousness
Amount of Pringles Left
How Ingenious a Pringles Can is:
This is
Awesome!
Hmm, this
is getting
awkward.
Crap, my
hand got
stuck!
@#$% it,
I'll just
drink them.
Source: www.HamiltonCountyAuditor.org
PacMan Eating Skittles
What is the message?
$500.07
$31.55
Food expenditure for one week
Germany Ecuador
Germany Ecuador
$500.07 $31.55
$0.00
$200.00
$400.00
$600.00
Food expenditure for one week
$500.07
$31.55
Germany
Ecuador
Food expenditure for one week
Germany: The Melander family of BargteheideFood expenditure for one week: $500.07 Source: “Hungry Planet”
By Peter Menzel and Faith D’Aluisio
Ecuador: The Ayme family of TingoFood expenditure for one week: $31.55 Source: “Hungry Planet”
By Peter Menzel and Faith D’Aluisio
Elle and Nina
Elle Nina
Stretching our thinking about data…
…what about this?
Stretching our thinking about data…
…what about this?
A Data Visualizationof your checking account
Checking Acct #123456789: $843.62
Checking Acct #123456789: -$43.17
Current Balance
Visualization and Data Presentation
Sparklines (Tufte):
Visualization and Data Presentation
Sparklines:
• Small, high-resolution graphics embedded in a context of words, numbers, images. Sparklines are data-intense, design-simple, word-sized graphics. (New in Excel 2010)
Visualization and Data Presentation
Sparklines:
Visualization and Data Presentation
Sparktweets
Latest Developments
Bandlines and Sparkstrips
Invented by Stephen Few (January 2013)
Visualization and Data Presentation
Trellis Charts (Small Multiples):
• "Illustrations of postage-stamp size are indexed by category or a label, sequenced over time like the frames of a movie, or ordered by a quantitative variable not used in the single image itself."
Visualization and Data Presentation
Box and Whiskers or Box Plot (Tukey):
Visualization and Data Presentation
Box and Whiskers or Box Plot (Tukey):
Box Plot vs. PDF
Visualization and Data Presentation
Common Mistakes:
• Chance is measured as %
– there is a 10% chance
• Probability is a number between 0 and 1
– The probability is .3
• Correlation is a number between -1 and 1
• Principal ($) not Principle (Think PayPal)
• Judgment and Management
• We pay $.035 for the widget not 3.5%
• Number Formatting and Rounding
(Excel example)
Color Vision Deficiency (CVD)aka Colorblind
Color Blind
Protanomaly (1 out of 100 males) – “red weak”
Deuteranomaly (5 out of 100 males) – “green weak”
Dichromasy (2 out of 100 males) -no perceptible difference between red, orange, yellow, and green
Protanopia (1 out of 100 males) -use light/dark
Deuteranopia (one out of 100 males) – “green blind”
= 10% of males are Color Blind
Main issue is Red and Green
Color
Bucket A (2.6%) Bucket B (4.2%) Bucket C (93.2%)
Paying 22.53% 19.93% 14.95%
On-time 51.27% 27.42% 71.01%
Late 18.63% 14.60% 1.63%
Stale 7.57% 38.05% 12.41%
0%
25%
50%
75%
% of Accts
VisCheckwww.vischeck.com
Deuteranope Simulation
VisCheckwww.vischeck.com
Protanope Simulation
Protanope Simulation
Bucket A (2.6%) Bucket B (4.2%) Bucket C (93.2%)
Paying 22.53% 19.93% 14.95%
On-time 51.27% 27.42% 71.01%
Late 18.63% 14.60% 1.63%
Stale 7.57% 38.05% 12.41%
0%
25%
50%
75%
% of Accts
Protanope Simulation
Created by:
Steve Wexler
www.datarevelations.com
Example in Practice
Compare and Contrast
Visualization and Data Presentation
Showing performance vs. target (ex. actual to forecast)
• Bar Chart with Target line
– Easy to build
– Good when single target for all categories
Bullet Graph – What is it?
Not this kind of chart…
Bullet Graph:
Target A
Target B
Target C
Invented by Stephen Few
Visualization and Data Presentation
Two Variations on the Bullet Graph:
• Remove performance bands
• Now it’s simply actual to target
• Easier to understand
• Dual Axis Bullet Graph
– Shows both value and %
Visualization and Data Presentation
Geospatial:
• Location can relate disparate data
• A Map is worth a thousand cells
• Easy to see problem areas
• By State, County, Zip, Block
• Even down to Address
Foreclosures (July 2008)
Source: RealtyTrac (www.RealtyTrac.com)
Source: Fox News (http://foxnewsinsider.com/2012/09/14/breaking-
university-of-texas-in-austin-orders-evacuation-due-to-threats-on-campus/)
Breaking News: Missouri invades Arkansas,Mississippi and Alabama Land Swap
Pharmacy Growth?
What the data actually shows:
2006 2007 2008 2009 2010 2011
77.8% 79.0% 79.5% 79.7% 80.0% 80.0%
Pharmacies 1,920 1,964 1,973 1,967 1,966 1,948
Stores 2,468 2,486 2,481 2,468 2,458 2,435
The Real Story
The Real Story What is the real message?
1. From 2007 to 2010 - closed stores that don’t have pharmacies thus increasing the % of stores with a pharmacy.
2. Increased Pharmacy Sales from $6.5B to $7.3B with fewer stores and fewer pharmacies
They missed a very compelling Story12% increase in Sales with Lower Costs!
Putting it all together
Our Tableau Server
Operations Dashboard
2008 Feltron Reporthttp://feltron.com
2007 Feltron Reporthttp://feltron.com
Compare and Contrast
Healthcare Examples
Compare and Contrast
Compare and Contrast
CitationsFrom Rome Reborn: The Vatican Library & Renaissance Culture, Library of Congress website. Accessed on 2/18/2012. http://www.loc.gov/exhibits/vatican/math.html
Few, Stephen (2007), Data Visualization: Past, Present, and Future, p.3
Wikipedia entry René Descartes, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012. http://en.wikipedia.org/wiki/Rene_descartes
Wikipedia entry Cartesian Coordinate System, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012 http://en.wikipedia.org/wiki/Cartesian_coordinate_system
Wikipedia entry William Playfair, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/18/2012 http://en.wikipedia.org/wiki/William_Playfair
Tufte, Edward R. (2001), The Visual Display of Quantitative Information, Second Edition, p. 9
The Oxford Dictionary of National Biography, ©2004-12 Oxford University Press. Retrieved on 2/18/2012. http://www.oxforddnb.com/view/printable/22370
Friendly, Michael (2001), Gallery of Data Visualization, Electronic document, http://www.datavis.ca/gallery/, Accessed: 02/19/2012 22:20:39
Tufte, Edward R. (2001), The Visual Display of Quantitative Information, Second Edition, p. 40
Bertin, Jacques (1983), Semiology of Graphics, English translation by William J. Berg, pp. 42, 65
Wikipedia entry Exploratory Data Analysis, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/19/2012.http://http://en.wikipedia.org/wiki/Exploratory_data_analysis
Wikipedia entry Edward Tufte, used under the Creative Commons-Share Alike 3.0 Unported license. Accessed on 2/19/2012. http://en.wikipedia.org/wiki/Edward_Tufte
Make sure to connect online and stay in touch:
Follow on Twitter:@HighVizAbility
Connect with us on LinkedIn:www.linkedin.com/in/jeffreyshaffer/
Websites:www.dataplusscience.com/workshop.htmlwww.dataplusscience.com/http://makingdatameaningful.com/
Facebook:https://www.facebook.com/DataPlusScience
Email:[email protected]