Data Visualization How to

Embed Size (px)

Citation preview

  • 8/20/2019 Data Visualization How to

    1/55

    Data Visualization

    A primer on best practices

    Lazaro Gamio | @LazaroGamio

    https://twitter.com/lazarogamiohttps://twitter.com/lazarogamio

  • 8/20/2019 Data Visualization How to

    2/55

    What we’re going to go over

    1. Evaluating data2. Finding stories in your data3. Finding a shape for your story

  • 8/20/2019 Data Visualization How to

    3/55

    Where I’m coming from, and the type o

    we do at @PostGraphics.

    Some context

  • 8/20/2019 Data Visualization How to

    4/55

    http://www.washingtonpost.com/graphics/world/world-happiness-2015/

  • 8/20/2019 Data Visualization How to

    5/55

    http://www.washingtonpost.com/graphics/world/world-happiness-2015/http://www.washingtonpost.com/graphics/world/scaling-everest/

  • 8/20/2019 Data Visualization How to

    6/55

    http://www.washingtonpost.com/graphics/world/scaling-everest/http://www.washingtonpost.com/wp-srv/special/national/cia-interrogation-report/word-count/

  • 8/20/2019 Data Visualization How to

    7/55

    http://www.washingtonpost.com/wp-srv/special/national/cia-interrogation-report/word-count/

  • 8/20/2019 Data Visualization How to

    8/55

    http://www.washingtonpost.com/graphics/health/how-fast-does-measles-spread/

  • 8/20/2019 Data Visualization How to

    9/55

    Feel free to interrupt

  • 8/20/2019 Data Visualization How to

    10/55

    Data needs to be found and vetted

    the same way sources are.

    Evaluating data

  • 8/20/2019 Data Visualization How to

    11/55

    It doesn’t make the process any easi

    There’s so much of it

  • 8/20/2019 Data Visualization How to

    12/55

  • 8/20/2019 Data Visualization How to

    13/55

    That’s also just one of many places wh

    data lives. Here’s a random exampl

    Gasp.

  • 8/20/2019 Data Visualization How to

    14/55

  • 8/20/2019 Data Visualization How to

    15/55

    And neither will my audience.

    I have no idea what this

  • 8/20/2019 Data Visualization How to

    16/55

  • 8/20/2019 Data Visualization How to

    17/55

    384,499 rows of goodness.It was a 63 mb spreadsh

  • 8/20/2019 Data Visualization How to

    18/55

    There’s just more to edit and sift thro

    Lots of data ≠ Easy dat

  • 8/20/2019 Data Visualization How to

    19/55

    Take it out to dinner and a movie.Ask about its hopes, dreams and aspira

     You need to spend timwith your data.

  • 8/20/2019 Data Visualization How to

    20/55

    Data is more than the final product.1. Who collected the data?2. Who inputted the data?3. Who formatted the data?

    4. How is the data being delivered?5. Who’s giving it to you?6. Is your data recent?

    Who is your source?

  • 8/20/2019 Data Visualization How to

    21/55

    Population of Azerbaij

    CIA WorldFactbook

    Wikipedia

  • 8/20/2019 Data Visualization How to

    22/55

    World Bank

    Population Referenc

  • 8/20/2019 Data Visualization How to

    23/55

  • 8/20/2019 Data Visualization How to

    24/55

  • 8/20/2019 Data Visualization How to

    25/55

    1. Data is often very specific.2. Data is really big.3. Your editing process begins at selecti

    dataset.

    4. You need to put in a lot of work to prodata into something digestible.

    Takeaways

  • 8/20/2019 Data Visualization How to

    26/55

    What do you want to learn? Most

    importantly, what do you want your reto learn?

    What does your data sa

  • 8/20/2019 Data Visualization How to

    27/55

    Questions to ask

    1. Who’s the ‘mostest?’2. Who’s the ‘leastest?’3. What is the ‘middle?’4. Who are the outliers? Why?

    5. Is there a trend over time?6. Does this trend correlate with anothe

    trend?

  • 8/20/2019 Data Visualization How to

    28/55

    1. It may be a good story.

    2. Your data may be flawed.3. It may mean nothing.

    Investigate outliers

  • 8/20/2019 Data Visualization How to

    29/55

  • 8/20/2019 Data Visualization How to

    30/55

    Sometimes large values distort your m

    and can skew your perspective of the dDon’t add to the confusion!

    Mean vs. Median

  • 8/20/2019 Data Visualization How to

    31/55

    Mean: $513,000Median: $350,000

  • 8/20/2019 Data Visualization How to

    32/55

  • 8/20/2019 Data Visualization How to

    33/55

  • 8/20/2019 Data Visualization How to

    34/55

    Do you know why there

    is a variation? You should.

    Trends

  • 8/20/2019 Data Visualization How to

    35/55

    Correlation ≠ Causatio

  • 8/20/2019 Data Visualization How to

    36/55

    1. Ask answerable questions of your dat2. Learn the limitations of your dataset3. Once you have findings, double-chec4. Then, Triple-check.

    Takeaways

  • 8/20/2019 Data Visualization How to

    37/55

  • 8/20/2019 Data Visualization How to

    38/55

    Before you start

    1. Do you really need a chart?2. Are you charting discrete quantities?3. Are you charting a time series?4. What do you want to draw attention

    5. Is your data geographical?6. If so, does location make a difference

  • 8/20/2019 Data Visualization How to

    39/55

    The most important thing is knowing exwhat you want to tell. The previous two

    are the most important.

    Content is king

  • 8/20/2019 Data Visualization How to

    40/55

    Are you trying to show one important v

    with a couple of other values for cont

    Charting the mostest

  • 8/20/2019 Data Visualization How to

    41/55

    The Mostest.

  • 8/20/2019 Data Visualization How to

    42/55

    Do you have data that changes over ti

    Chronological data

  • 8/20/2019 Data Visualization How to

    43/55

    A simple trend

  • 8/20/2019 Data Visualization How to

    44/55

    While maps are often useful, they are o

    not the best tool for your story.

    Resist the urge to map

  • 8/20/2019 Data Visualization How to

    45/55

    Gun de

    People married tosomeone they metin high school

    http://www.slate.com/articles/news_and_politics/crime/2012/12/gun_death_tally_every_american_gun_death_since_newtown_sandy_hook_shooting.html

  • 8/20/2019 Data Visualization How to

    46/55

    Do it for the children.

    Don’t make

    population maps

  • 8/20/2019 Data Visualization How to

    47/55

    Have two datasets that make sense tog

    and help tell the same story?

    Two Charts

  • 8/20/2019 Data Visualization How to

    48/55

    Two trends.

    T k

  • 8/20/2019 Data Visualization How to

    49/55

    1. Keep it simple. Your readers willappreciate it.

    2. Pick the chart that best proves your p3. Tell the reader what they should be lo

    at.4. If appropriate, tell the story in multip

    charts.

    Takeaways

  • 8/20/2019 Data Visualization How to

    50/55

    Excel, Google Docs, Tableau, Datawrap

    Matplotlib, D3, Adobe Illustrator, et

    Tools

  • 8/20/2019 Data Visualization How to

    51/55

    Google Docs/Excel

  • 8/20/2019 Data Visualization How to

    52/55

    Great options

  • 8/20/2019 Data Visualization How to

    53/55

    D3.js

  • 8/20/2019 Data Visualization How to

    54/55

    Questions?

  • 8/20/2019 Data Visualization How to

    55/55

    Lazaro Gamio | @LazaroGamio

    Thanks!

    https://twitter.com/lazarogamio