Upload
cviga
View
218
Download
0
Embed Size (px)
Citation preview
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
1/12
03 | Managing Data Quality
Graeme Malcolm | Data Technology Specialist, Content Master
Pete Harris | Learning Product Planner, Microsoft
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
2/12
Data Quality Services
Data Cleansing
Data Matching
Module Overview
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
3/12
The Data Quality Problem
Business decisions rely on trusted data Data quality issues in data sources can lead to inaccu
reporting and analysis Invalid data values (e.g. Californa)
Inconsistencies ( e.g. California and CA Duplicate business entities (e.g. Jim Corbin, James Corb
SQL Server DQS is a knowledge-based solution for: Data Cleansing
Data Matching
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
4/12
Knowledge Bases and Domains
Knowledge base: Repository of knowledge about data
Domain validation rules
Matching policies
Domains: Specific to a data field (or a composite field)
Contain values and validation rules for members Valid(e.g. California and CA for a Statedomain) Invalid(e.g. 90261 for a Statedomain Error(e.g. Californa for a State domain)
Define rules to correct values to leadingvalues
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
5/12
Demo: Creating a Knowledge Base
In this demonstration, you will see how to:
Create a Knowledge Base
Perform Knowledge Discovery
Perform Domain Management
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
6/12
Data Cleansing Projects
1.Select a knowledge base
2.Map data columns todomains
3.Review suggestions and
corrections4.Export results
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
7/12
Demo: Cleansing Data
In this demonstration, you will see how to:
Create a Data Cleansing Project
View Cleansed Data
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
8/12
Cleansing Data in SSIS
SSIS includes a DQS leansing Transformation
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
9/12
Data Matching
Define matching rules for business entities in a
matching policy
Rules match entities based on domains: Similarity: Similar or exact match
Weight: Percentage to apply if match succeeds
Prerequisite: Mandatory domain match for rule to succeed
If the combined weight of all matches meets or exceeds the minimum matching score, the entities are duplicates
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
10/12
Demo: Matching Data
In this demonstration, you will see how to:
Create a Matching Policy
Create a Data Matching Project
View Data Matching Results
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
11/12
Module Summary
Create a Knowledge Base for your data Use data cleansing to ensure consistent, correct dom
Use data matching to identify duplicate data entities
8/12/2019 SQL2014 Updating Your Skills MVA Module 3
12/12
2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Office, Azure, System Center, Dynamics and other produc t names are or may be registered trade
U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this pre
must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of
the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.