View
5
Download
1
Category
Preview:
Citation preview
`
AI-Powered Data Cataloging Virtual Summit
Informatica EDC Adoption Best PracticesKhuan TanSenior Director, Product Management
2 © Informatica. Proprietary and Confidential.2
Need for Enterprise Data Catalog
Improve Customer Experience
Gain Competitive Advantage
Accelerate Time-to-Market
Foundation for Digital Transformation
3 © Informatica. Proprietary and Confidential.3
Enterprise Data Catalog OverviewA central place to collect, index, relate, annotate, and share knowledge about data assets
Broad Metadata Sources• Technical• Operational• Usage
Business Context• Glossary• Policies• Process
Wisdom of Crowd • Comments• Ratings• Behavior
Knowledge Graph
Business and Crowd Sourced Curation
AI Curated Catalog
Enterprise Data Catalog
Data Governance[Data Stewards/Data Owners]
• Associate business glossary to technical objects
• Verify business to technical lineage• Track key data elements compliance
Self Service Analytics[Data Analysts/Data Scientists]
• Google for enterprise data assets• Data lineage, holistic relationship view• Trust with data profile• Access to data
Data Asset Management[Architects/Developers]
• Analyze column-level lineage and change Impact
• View transformation logic• Data asset and BI usage
Structure Discovery, Profiling and Domain Discovery,
Similarity Clustering, Recommendations
Business Glossary Associations, Business
Classifications, Annotations, Comments
4 © Informatica. Proprietary and Confidential.4
EDC Value Across Key PersonasAccelerate digital transformations with faster data discovery, shorter time-to-insight, and reduced time-to-market
Data Analysts to easily search, discover, understand, and find data assets for analytics Data Scientists to understand impact to analytical models due to data structure, content and value changes
Data Stewards to effectively curate, certify data assets, analyze data lineage, and track key data elements for compliance
Data Architects to efficiently manage data assets, comprehend data transformations, and assess change impact
Data Steward Data Owner
DeveloperData Architect
Central Catalog
Business Analyst
Data Analyst
Data Scientist
5 © Informatica. Proprietary and Confidential.5
Path to Success with EDC
1
Formulate Program Strategy
2
Implement Program Strategy
3
Refine & Expand
6 © Informatica. Proprietary and Confidential.6
Define program strategy which:
• Aligns and supports your corporate vision• Focuses on delivering business impact with use cases linked to business drivers• Addresses user pain points• Communicates program value and long term direction
Corporate Vision
Identify Business Drivers/
Outcomes
Define Use Case and
Identify Pain Points
Define Your Program Strategy
Formulate your Program Strategy
7 © Informatica. Proprietary and Confidential.7
Example Program Strategy-based Vision, Business Drivers, Use Cases, and Pain Points
Corporate Vision• Aspire to provide the best health care
for our customers
Business Drivers• External drivers
- Increasing regulation- Support online patient self-service
• Internal drivers- 360 view of customers- Revenue growth
1
Use Cases • Lineage of key data elements• Customer data quality• Migrate DWH to cloud• Self-service analytics with big data
2
Challenges and Pain Points • Creating data lineage compliance report of key data elements is two
months of manual effort each year
• Data quality is questionable, not sure who is responsible
• No common business definitions causing data consistencies
• Data analysts and data scientists spends >75% of other time finding trusted, relevant data sets for analytics
Data Catalog Program Strategy• Democratize data asset knowledge and usage through a central
data catalog to support cross-functional collaboration, self-service analytics, IT modernization, and Data governance
3
3
8 © Informatica. Proprietary and Confidential.8
Start small with Pilot Project
• Define success criteria
• Evaluate use cases against success criteria
• Select use case for project
Refine and Expand
Guide to Implementing your Program Strategy
Avoid the pitfall of implementing data catalog as a technical/IT driven project scanning all sources with user adoption as an afterthought, or fall into trap of providing “all or nothing” solution demanded by your users. Instead learn from pilot, deliver quick wins, and expand.
Ingest and Enrich Metadata
Assign Users and Responsibility
Train Users, Track Usage, Communicate Often
• Start by ingesting 3 to 4 key metadata resources for the pilot
• Enrich your data catalog to provide business context and quickly find the right data asset
• Create users and users groups
• Assign user groups and permissions to resources
• Configure user privilege to prevent viewing of sensitive data on column data profile
• Train your users• Set user expectation• Monitor usage and
measure business impact
• Communicate program progress and direction with users and stakeholders
9 © Informatica. Proprietary and Confidential.9
Enrich Your Data Catalog
Why enrich
catalog?
• Think of the data catalog as your enterprise knowledge repository of data assets
• Ingesting technical metadata from sources is first step but it has limited value in helping your users find data assets, understand the business context, and use data assets appropriately
• Enriching your catalog amplifies the value of your data assets
What are the
options?
• EDC provides extensible options to enrich your catalog, such as: - Business terms- Data profiling- Data domain - Relationships- Similarity- Synonyms- User reviews- User ratings- Asset certification- Custom attributes- and more…
Where to start?
• Start enriching critical data assets with
Business title and description Business term Data owner, data steward Certify data assets Data profile Custom attributes, such as,
business usage, category Data domains, such as, PII
and PHI
10 © Informatica. Proprietary and Confidential.10
Practical Steps for Enriching Data Catalog
Create a resource to ingest Business Glossary
Create custom attributes
Create a resource to ingest source metadata (e.g., DB, BI, ETL)
Also configure:
a) Assign data owners
b) Profile data
c) Discover data domain
d) Assign and propagate custom attribute values
e) Auto-associate business terms
1
2
3
Catalog Admin UINote: A resource is a catalog object that represents an external data source or metadata repository from where scanners extract metadata. Assign object business title and
description individually or bulk upload via export/import
Certify data assets
4
5
Catalog End-User UI
User reviews, ratings, Q&A6
11 © Informatica. Proprietary and Confidential.11
Sample EnrichmentsAsset
Certification
Data Domain
Data Owner
Business Terms
Custom Attributes
Data Profile
Business Title
12 © Informatica. Proprietary and Confidential.12
Train and Support Your Users
Create Training Plan by User Roles• Create training curriculum by user roles • Train your support staff first; they will
be your internal functional and technical EDC experts
• Leverage Informatica University on-demand training, IPS Adoption Services
• Customize training for your users
Create Training Content and Videos• Engage your subject matter experts to
define hands-on training content• Compare and contrast the current
process versus using EDC to demonstrate the benefits using EDC
• Communicate the bigger picture with your program strategy and set user expectation
• Create short 3-4 minute videos as a how-to reference for performing common tasks
Leverage Business Champions to Train and Drive User Adoption • Create business ownership by
leveraging your champions as influencers to drive adoption in their team
• Leverage your company’s enablement experts
• Create training image on cloud to quickly spin-up training images and future training upgrades
1
2
3
1313 © Informatica. Proprietary and Confidential.
Summary
• Align corporate business direction
• Plan strategically, start with users in mind
Formulate Strategy• Keep first rollout simple,
focused; quick wins
• Make catalog relevant to users with business context information
Start with a Pilot• Train and show users how
data catalog helps them produce results
• Engage early adopters as your champions
Train Users• Conduct feedback
workshops, monitor usage
• Communicate success and expand program
Refine and Expand
Khuan Tan Dan Rothstein
Fireside Chat
Senior Director of Product Management, Enterprise Data
Catalog
Senior IT Data Architect
15 © Informatica. Proprietary and Confidential.15 © Informatica. Proprietary and Confidential.15 © Informatica. Proprietary and Confidential.
• Business-driven, not IT-driven
• High-level stakeholders support
• Start small focusing on one business area
• Gradually expand driven by business needs
• Ensure technical side (e.g. scanning new data sources) doesn’t get too far ahead of the business side (e.g. business glossary association)
• Ensure business stakeholders are actively engaged in every step
EDC Adoption – Lessons Learned
16 © Informatica. Proprietary and Confidential.16
ResourcesResources Description URL Link
Informatica EDC Learn more about EDC Link
Informatica Network Informatica Product Documentation, Releases, Communities, Support, and Knowledge Base articles Link
Informatica University Success Academy for EDC user role based on-demand training Link
EDC Community EDC Discussions, Documents, Videos, Blogs, How-Tos Link
EDC On-line Documentation Access EDC on-line guide by clicking on the Help “?” icon on the Catalog User UI.
Link
EDC Concepts* Key concepts in EDC include catalog, resource, scanner, data domain, etc. A great starting point for learning about EDC.
Link
EDC on GitHub EDC RestAPI Samples on GitHub Link
Product Availability Matrix Metadata scanner, OS, Repository DB support matrix Link
Informatica Professional Services Provides EDC business adoption and consulting services Link
`
Thank You
Recommended