Upload
jen-stirrup
View
641
Download
0
Embed Size (px)
Citation preview
Datavis Barista: How to choose what dataviz tool, and when
Jen StirrupFounder, Data Relish
Level: Intermediate
http://www.designyourway.net/blog/inspiration/when-infographics-go-bad-or-how-not-to-design-data-visualization/
Why R?• most widely used data analysis software - used by 2M + data scientist,
statisticians and analysts• Most powerful statistical programming language• flexible, extensible and comprehensive for productivity• Create beautiful and unique data visualisations - as seen in New York Times,
Twitter and Flowing Data• Thriving open-source community - leading edge of analytics research• Fills the talent gap - new graduates prefer R.
Growth in Demand• Rexer Data Mining survey, 2007 - 2013
• R is the highest paid IT skill Dice.com, Jan 2014
• R most used-data science language after SQL - O'Reilly, Jan 2014
• R is used by 70% of data miners. Rexer, Sept 2013
Growth in Demand• R is #15 of all programming languages. REdMonk, Jan
2014
• R growing faster than any other data science language. KDNuggs.
• R is in-memory and limited in the size of data that you can process.
What do I need to install?• Install R – www.r-project.org• Install Rstudio – www.rstudio.com
• Handy Shortcuts • Tab – autocomplete of available functions• Control and Up Arrow – History• Control and enter – executes the line of code
What tools do we have in R?• 80% of your time will be spent preparing and wrangling data• The remainder of your time will be spent complaining about it.
• dplyr: the essential data manipulation toolset• In data wrangling, what are the main tasks? • – Filtering rows
– Selecting columns of data– Adding new variables– Sorting– Aggregating
Kibana• It is highly customizable dashboarding• It is constituted of panels:
– Time picker / Query / Filtering– Charts / Table / Text
Flexible analytics and visualization platformReal-time summary and charting of streaming dataIntuitive interface for a variety of usersInstant sharing and embedding of dashboards
To better understand large volumes of data..
• easily create bar charts• line and scatter plots• Histograms• pie charts• maps.
To better understand large volumes of data..
• easily create bar charts• line and scatter plots• Histograms• pie charts• maps.
Default Chart TypesChart Type Basis Values Types PurposeHistogram Timestamp based Count, Mean, Total Barlines, stacks,
percentagesQueries
Table Paging Fields list Highlighting, sorting
Fine grained analysis
Pie Charts Terms Missing terms, other
Doughnut, legends, tables
Proportion