23
Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15 th , 2019

Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Data Prep 101

Alex Ziko,Data Analyst

James Cousins,Senior Statistical Analyst

January 15th, 2019

Page 2: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Itinerary• Who We Are

• Define Data Prep

• Data Prep Challenges

• Overcoming the Challenges

• Software Demonstration

• Upcoming Events

Page 3: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

About Rapid Insight

Founded in 2002 and headquartered in Conway, NH

Predictive analytics and data preparation software company empowering professionals of all skill levels to turn raw data into actionable insights

Serving hundreds of customers worldwide, ranging from healthcare to higher education

The Veera platform enables users to easily build predictive models, perform advanced data analysis, and share insights

Code free (but code friendly)self-service analytics platform

Page 4: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Meet Your Presenters

Alexander ZikoData Analyst and Customer Success Manager

As a Data Analyst and Customer Success Manager, Alex works closely with customers to help themunderstand their data using the Veera platform. Alex holds a BS from Green Mountain College, and anMBA from Franklin Pierce University. When not in the office Alex is usually found on the coast of Maineor in the mountains of New Hampshire.

As a Senior Statistical Analyst, James works directly with organizations bringing data to bear indecision-making, building analytic capacity along the way. His work has involved hundreds oforganizations- from a single analyst to teams of more than ten. James holds a B.S. in Mathematics fromDickinson College, and is pursuing his M.S. in Data Analytics from Johnson and Wales University.

James CousinsSenior Statistical Analyst

Page 5: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Data Prep 101

Alex Ziko,Data Analyst

James Cousins,Senior Statistical Analyst

January 15th, 2019

Page 6: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Data Prep/da·ta prep/ [noun]1. The process of transforming data from its raw form into information that is

useful for reporting, analysis, and predictive analytics

2. The duty that often occupies 80% of a data analyst’s workload

Page 7: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Projects Reliant on Data Prep

•Reporting

•Predictive Modeling

•Ad Hoc Analysis

Page 8: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Data Prep Challenges

•Disparate Data Sources

•Messy Data Entry

•Manual Processes

•Varying Expertise

Page 9: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

The Ideal Data Analyst Toolkit

Data Access Intuitive ToolsScheduled Processes

Page 10: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

MergingWhen your data is scattered in multiple datasets, merging allows you to combine the relevant parts of those data sources to create a new dataset.

Page 11: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

AppendingStacking two datasets to create one larger dataset is called appending. When appending data, the datasets typically contain the same (or very similar) fields.

Page 12: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

FilteringBy filtering a dataset, you are narrowing it down to just a specific group of records.

Page 13: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

DeDupingTo dedupe is to remove duplicates from a dataset. Selection rules can be made to dedupe on specific conditions.

Page 14: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Data CleansingTo cleanse a column is to edit or replace values within the column cell.

Page 15: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

RenameRenaming allows you to enter a new name for your columns.

Page 16: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

TransformingTo transform a column is to perform an operation that creates a new outcome — this could be a new variable entirely, or a different version of the original column.

Page 17: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

AggregatingAggregating allows you to select specific variables and calculate summary statistics.

Page 18: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

TransposingBy transposing you can turn your rows into columns.

Page 19: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

ConnectIntegrate data in any format, from virtually any source

PrepareCreate step-by-step processes using easy, drag-and-drop visual workflows with no coding required

AnalyzeBuild and schedule jobs to run automatically, or run on-demand analyses

ShareWrite back to databases, create and disseminate reports, publish dashboards to visual analytics tools such as Tableau, or output datasets for predictive modeling

Page 20: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Software Demonstration

Page 21: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Expert Tips on the Data Prep ProcessJanuary 29, 2 PM ET / 11 AM PT

Join Senior Statistical Analyst James Cousins as he discusses some of the most common data preparation projects. He will explore ways you can make your data tasks more reliable, accurate, and faster. While these tips benefit all industries, you can expect real-use cases from healthcare, higher education, and fundraising.

www.rapidinsightinc.com/blog/webinars/

Upcoming Events

Page 22: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Questions?

Page 23: Data Prep 101 - Rapid Insight...Data Prep 101 Alex Ziko, Data Analyst James Cousins, Senior Statistical Analyst January 15th, 2019

Thank You!

For more information visit www.rapidinsight.com

[email protected]@rapidinsightinc.com

[email protected]