18
® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

Embed Size (px)

Citation preview

Page 1: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

®

IBM Software Group

© IBM Corporation

IBM Information Server

Understand - Information Analyzer

Page 2: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

IBM Information ServerDelivering information you can trust

Understand

Cleanse Transform Deliver

Discover, model, and govern information

structure and content

Standardize, merge,and correct information

Combine and restructure

information for new uses

Synchronize, virtualize and move information for in-

line delivery

ParallelProcessing Connectivity Metadata DeploymentAdministration

Platform Services

Support for Service-Oriented Architectures

2

Page 3: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

3

The IBM Solution: IBM Information ServerDelivering information you can trust

Cleanse Transform Deliver

Parallel Processing

Rich Connectivity to Applications, Data, and Content

IBM Information Server

Unified Deployment

Unified Metadata Management

Understand

Information AnalyzerData profiling for understanding what data you have and how it relates to other data, plus data analysis for measuring and monitoring ongoing

data quality.

Page 4: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

4

Data ProfilingCritical Problems: You don’t know what data is really in your

legacy systems Sources have changed or are new and

unknown

Why? Data values and relationships are

inconsistent and divergent from documented rules

Incomplete and missing documentation Data sources are never static and

frequently change without warning

Alternative Approach Labor intensive, resource devouring

process Never review 100% of data elements No infrastructure to support maintenance No standardized approach across

projects 1st generation tools document but don’t

address the problem resolution

Mainframe manufacturing system

Demographic

Contact

Billing / Accounts

External Lists

Distribution

ERP from acquisition

Parts BOM

Data SourcesData Sources

Page 5: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

5

About Information Analyzer

Automates your data discovery process

Enables you to understand your data before starting development

Eliminates the risk and uncertainty of using bad data

Useful in any type of data migration project

Analyzes every data attribute and reverse engineers the true meta data of your source

Reduces time to analyze data

Mainframe manufacturing system

Demographic

Contact

Billing / Accounts

External Lists

Distribution

ERP from acquisition

Parts BOM

Data SourcesData Sources

Page 6: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

6

IBM Information Analyzer

Reduce Time to Value of Data Projects

Increase the Productivity of Data Personnel

Assess Data Quality & Consistency across the Enterprise

Results sharable across IBM Information Server

Data Profiling: the process of analyzing a data sources to determine its content, quality and structure

Page 7: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

What does Information Analyzer provide?

Source System Analysis Provides the key understanding of the source data

Column & Domain analysis

Table/Primary Key analysis

Foreign Key analysis

Cross-Domain analysis

Iterative AnalysisLeverages the analysis to facilitate iterative tests

Baseline analysis

7

Foreign Key &Cross-Domain Analysis

Primary Key Analysis

Co

lum

nA

na

lysis

Source 1 Source 2

Page 8: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Source System Analysis

Column & Domain analysis Infers from content a column’s classification, physical properties, and

frequency distribution

Table/Primary Key analysisValidates the uniqueness of the identified key column, which allows

us to ensure that a given row of data can be clearly identified and related to other data

Cross-Domain & Foreign Key analysisSyncronizes the structure, relationships and integrity of data

environments by finding and validating otherwise unknown relationships and identifying critical integrity violations that need to be rectified.

8

Page 9: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Column Analysis: Tabular View

9

Page 10: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Column Analysis: Chart View

Frequency DistributionView Frequency Distribution either in Tabular or in Graph

Add user defined value to Frequency Distribution

Generate Reference Tables

Sort and Filter Frequency Data

10

Page 11: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Column Analysis: Properties

PropertiesSix property values are inferred for each column: Data Type,

Length, Precision, Scale, Nullability and Cardinality Type.

Distribution of data types, lengths, precisions and scales is displayed graphical.

11

Page 12: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Primary Key Analysis Results

Reviewing DuplicatesView Summary of Distinct and

Duplicated Values

Display list of all Primary Key values and #/% Duplicated.

12

Page 13: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Cross Domain Analysis Results

13

Page 14: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

14

Baseline Differences

Detailed results for the column level.

Results include the column level summaries of distinctions for both Structure (Defined and Inferred) and Content.

Baseline Analysis

Page 15: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

Sharing Analysis across Information Server

15

Page 16: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

16

Company Facts :• Largest distributor in North America

• Four major acquisitions in last two years• 12,000 branded products• 30,000 clients• 11 operating centers

Integration of supply chain management systems

Profit margin analysis systems

Field expansion, and take along project

Staff changes and limited documentation related to acquired systems

Only 7% of data being analyzed, but bad data causing 20% of cost overruns

Estimate 10k hours and $650k in costs to support first four projects

80% productivity gain for analyzing data sources

$504,000 annual savings in lower development and maintenance costs

Repeatable process for all future projects that ensures good, actionable data

Project Goals Challenges Results

ROI: Food Distribution.

Page 17: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

IBM Software Group

17

ROI: Top US Life Insurance Company

Competitive pressures requires the company to further enhance an existing competitive advantage – 360 degree customer view and 24\7 data availability. .

Detailed customer data resided in ten disparate legacy systems with little to no documentation. Presenting raw detailed data 24\7 was impossible.

Leveraging IBM allows for consistent data formats, validate data domains, define business rules linking policy data.

Better customer visibility.

Reduced costs by eliminating expensive and time-consuming investigations of detailed data.

Redeploying an investigator saves $130k annually.

Project Goals Challenges Results

Company Facts :

• #1 Largest Life Insurance Company in USA

• 138US$ billion in assets under management

• Offer complete like of life insurance, investment, retirement and related products

Page 18: ® IBM Software Group © IBM Corporation IBM Information Server Understand - Information Analyzer

®

IBM Software Group

© IBM Corporation

Thank You