30
Guidelines for Getting Started with Teradata from Barry 1. Management Summary.................................................. 1 2. Getting Started..................................................... 2 3. What Have We Learned?.............................................. 13 4. The Author......................................................... 13 Appendix A. Data Model Templates......................................14 1. Management Summary There are three Phases in the lifecycle of a typical Enterprise Data Warehouse :- 1. Getting Started 2. Assessment of an operational Data Warehouse 3. Business As Usual (BAU) – changes and enhancements This Paper discusses Phase 1 – Getting Started – and has the objective of promoting discussion of Best Practice. The later two Phases are described in a separate Paper which is available on demand from Database Answers at [email protected] . We use Transportation as the Application to discuss our Approach but it applies equally well to any other area for which an Industry Data Model is available from Database Answers :- http://www.databaseanswers.org/data_models/index.htm Phase 1 – Getting Started This incorporates a Blueprint and will incorporate any appropriate Teradata Industry Data Model. It will then involve design of a Conceptual High-level Model and Subject Areas Models. This Chapter provides material that is suitable for presentations, preparation of a business case, clarification of the requirements or to establish the scope of a Data Warehouse. Page 1 31-05-2022 08:01

databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

1. Management Summary................................................................................................................................1

2. Getting Started.............................................................................................................................................2

3. What Have We Learned?............................................................................................................................13

4. The Author..................................................................................................................................................13

Appendix A. Data Model Templates...............................................................................................................14

1. Management SummaryThere are three Phases in the lifecycle of a typical Enterprise Data Warehouse :-

1. Getting Started

2. Assessment of an operational Data Warehouse

3. Business As Usual (BAU) – changes and enhancements

This Paper discusses Phase 1 – Getting Started – and has the objective of promoting discussion of Best Practice.

The later two Phases are described in a separate Paper which is available on demand from Database Answers at [email protected].

We use Transportation as the Application to discuss our Approach but it applies equally well to any other area for which an Industry Data Model is available from Database Answers :-

http://www.databaseanswers.org/data_models/index.htm

Phase 1 – Getting Started

This incorporates a Blueprint and will incorporate any appropriate Teradata Industry Data Model.

It will then involve design of a Conceptual High-level Model and Subject Areas Models.

This Chapter provides material that is suitable for presentations, preparation of a business case, clarification of the requirements or to establish the scope of a Data Warehouse.

Presentations can be prepared for specific Industry Areas using this material, supplemented with ‘Kick-Start’ Data Model Templates in Appendix A for Banking, Communications, Retail, Transport and Web Visit Statistics.

Page 1 20-05-2023 14:36

Page 2: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2. Getting Started2.1 What is this ?This Chapter describes how to get started with a Teradata Project.

2.2 Why is it important ?It is important because it helps us to understand the overall process and concepts of a Data Warehouse.

2.3 A Blueprint

This is a Blueprint for an Architecture where rooms become Subject Areas in a Data Model.

Page 2 20-05-2023 14:36

Page 3: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.3B Another Blueprint

This is a Blueprint for an Architecture where rooms become Subject Areas in a Data Model.

Page 3 20-05-2023 14:36

Page 4: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

Page 4 20-05-2023 14:36

Page 5: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.4 A Transport Enterprise Data Model

This shows an extract of the Teradata Transport and Logistics Data Model.

It acted as a starting-point for the creation of a version of the Model for Maersk, which was the Teradata Customer.

The yellow area in this diagram shows an EDM with separate Subject Areas :-

Agreement, Asset, Event, Geography, Party, Service, Shipment and Transportation.

These are shown in larger sizes around the yellow rectangle.

TOP-LEVEL BUSINESS DATA MODEL FOR MAERSK LINE

SERVICE

GEOGRAPHY

ExampleGeo Types :

TRANSPORTATION

AGREEMENT

Example Party Roles :

PARTY

ExamplePartyRoles :

ASSET SHIPMENTEVENT

Business Process Management

Work Process Specification

Workflow:- Actual- Planned

Business Metric

Work Process Actual

Business Process Management

Work Process Specification

Workflow:- Actual- Planned

Work Process Actual

Financial Planning, Analysis, Management and Consuklting

Financial information:- Actual- Multiple Plans

Geographic Defined Area

Site

Country

City

Zone

Service

Geography

Transportation

Asset

Event

Party

Agreement

Shipment

Dated Voyage

Shipment Route Link

Shipment Route Point

Shipment Route

Dated Voyage

Shipment Route Link

Shipment Route Point

Shipment Route

Transport Plan

AgreementTariff

Service Contract

String Role

PartySegment

Vendor

Customer

Employee

Internal Organisation

Carrier

Vessel

Asset

Equipment

Container

Equipment Size/TYpe

Shipment

Transport Document

Shipment Equipment Assignment Cargo Package

Cargo

Cargo Conditioning RequirementEvent

Dated Site Call

Equipment Event

Business Process Workflow Business Process Workflow SLABusiness Process Workflow GL Account Chart of Accounts Balance

Account Transaction

Actual Balance

Financial Plan Balance

Allocation (Rule)

Communication Mode

Calendar & Time

Currency

Unit of Measure

Commodity

Service

StringZone Deployment Plan

Transport Mode Transport Link

Non-Dated Zone Call

Rotation

Rotation Service Assignment

Service

StringZone Deployment Plan

Transport Mode Transport Link

Non-Dated Zone Call

Rotation

Rotation Service Assignment

BPM Data KPI Data

Key Perfpormance Indicator

Financial Data Reference Data

Commodity

Page 5 20-05-2023 14:36

Page 6: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.4B A Canonical Data Model

This shows the Database Answers Canonical Data Model which is on this page :-

http://www.databaseanswers.org/data_models/canonical_data_models/index.htm

Page 6 20-05-2023 14:36

Page 7: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.5 A Data Architecture and BIReady

This shows six Layers in a Data Architecture for Data Warehouses.

On the left-hand side we show the layers in a Teradata Data Architecture.

On the right-hand side, we use a product called BIReady that provides the functionality within the dotted lines which corresponds to the Data Integration and Data Marts Stages in the Road Map.

In other words, BIReady automates the implementation of ETL and ELT functions, the Data Warehouse and the Data Marts.

BIReady facilities are shown in the dotted square

Page 7 20-05-2023 14:36

Data Governance

Semantic Data Model

Business Intelligence (BI)

Operational Data Stores

3NF Data Warehouse

Dimensional Models

Data Marts

Page 8: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.6 Stages in loading Data into a Warehouse

There is a logical sequence to the Stages of loading data into a Warehouse.

The lowest levels of data provide a foundation. The lowest level is Reference Data, such as Currencies and Languages.

Then, after these have been identified, sources and loaded we can move to the Master Data.

This includes Products, Services, Customers and so on.

Then we come to the more dynamic data such as Sales, Transactions and so on.

Best Practice states that we should set up load procedures that can be run so that data at each Stage can be deleted and replaced repetitively.

This shows three Stages of data in a typical Data Warehouse.

Page 8 20-05-2023 14:36

Stage 2) Master Data Products/Services, Customers etc.

Stage 3) Customer Sales (Adjustments, Transactions, etc.)

Stage 1) Reference Data (Currencies, Languages ,etc.)

Page 9: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.7 Business Intelligence (Top-down)

Key Performance Indicators (’KPIs’) provide an excellent starting-point for Business Intelligence.

Basic KPIs can be defined for Profitability and Operational Performance.

These can be enhanced in collaboration with the business users to establish commitment.

Profitability can iniitially be defined as a basic difference between Profits and Revenue.

Performance can be measurement of Deliveries being ’On-Time’ against plans agreed in Service Level Agreements (SLAs).

The initial Data Architecture will include this basic structure of KPI and Source Data.

This will provide a simple but flexible framework which can be enhanced in a controlled manner.

We will use this as a starting-point for discussions to establish the requirements with the business users.

Page 9 20-05-2023 14:36

Profitability

Revenue minus Costs

Revenue Data Source

Costs Data Source

Performance

Deliveries against Plans

Dates and Deliverie

Contracts and SLAs

Page 10: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.8 Build the Data Warehouse IncrementallyA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that provide a validation with each Step.

2.8.1 Canonical Data Model

This shows a Common Data Model that provides an excellent Starting Point for building an Enterprise Data Warehouse.

2.8.2 Event Message

We can define Events with corresponding Messages.

For each Message, we can add to the design of our Data Warehouse.

The strength of this approach is that it validates the design of the Data Warehouse.

This Common Data Model defines a Message for each Event.

Page 10 20-05-2023 14:36

Page 11: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.8.3 Example Event Message (1)

For example, a Customer purchases one Product at a Store :-

Date Store ID Customer Name Product Quantity Amount

The Business Rules would say :-

1. Each Sale is made to one Customer

2. Each Sale can involve just one Product

3. Each Sale is made at a specific Store

4. Each Sale is made at a specific valid Date and Time which is defined in a Calendar

Therefore we can define this fragment of a normalised Data Model :-

Page 11 20-05-2023 14:36

Stores

Sales

Products

Calendar Customers

Page 12: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.8.4 Event Message (2)

In this Event, the Customer buys two Products. Therefore our initial Data Model must be enhanced to have a separate Table for ‘Products Sold’.

When we continue to include more Events, we enhance our Model until it is complete.

Date Store ID Customer Name

Product 1 Quantity 1 Amount 1 Product 2 Quantity 2 Amount 2 Total Amount

This Event changes one of the Business Rules :-

1. Each Sale can involve multiple Products

Therefore we add a separate entity called ’Products Sold’ to our Data Model :-

Page 12 20-05-2023 14:36

Stores

Product

SalesCalendar

Products Sold

Customers

Page 13: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

2.9 Generic Dimensional Model

The existence of Dimensions and Facts makes it possible to define a Generic Design for a range of Dimensional Models.

This example shows six Dimensions and six Facts.

Each of these Dimensions and Facts can be translated from the generic names to specifics, such as Customer, Product, Location, Time Period and so on.

2.10 Simple Data Architecture

A Data Architecture should be composed of separate Components with separate and clearly-defined functionality. The 3NF Data Warehouse implements data integration and provides a ‘Single View of the Truth’ (SVOT). The objective is to show the major Building Blocks in a simple clean Architecture.

A more detailed version is shown in the Data Warehouse Bus Architecture in Chapter 3.

2.11 Sample Kick-Start Data Warehouses

Some sample Data Warehouses are included in Appendix A for a starting-point in discussions.

Page 13 20-05-2023 14:36

ODS Staging Area

3NF Data Warehouse)

Dimensional Models (Data Marts)

Semantic Layer

Page 14: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

3. What Have We Learned?We have learned some very important rules that will help us to get started in an organized manner and put in place a sound foundation that we can build on in a safe and stable manner.

4. The AuthorBarry Williams is the founder and principal consultant with Database Answers.

His company has been providing advice and assistance to blue-chip clients for over 20 years.

His particular interest is in advancing the role of data models as a way of improving communication between the business user community and data management professionals.

As part of this role he publishes best practice on his Database Answers Web site at

http://www.databaseanswers.org/

Page 14 20-05-2023 14:36

Page 15: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

Appendix A. Data Model TemplatesA.1 What is this ?These Templates provide introductory Data Models for some specific Applications.

A.2 Why is it important ?These Templates provide ’Kick-Start’ samples to get up and running quickly.

They also provide samples that can be used to assess existing Data Models for ‘fit for purpose’.

A.3 Banking

A.3.1 Data Warehouse

This shows theTemplate EDW for the analysis of retail Banking.

Page 15 20-05-2023 14:36

Page 16: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.3.2 Dimensional Model

This shows theTemplate Dimensional Model for retail Banking.

Page 16 20-05-2023 14:36

Page 17: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.3.3 Operational Data Stores (ODS)

This shows the ODS for the analysis of retail Banking.

Page 17 20-05-2023 14:36

Page 18: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.4 Communications

A.4.1 Data Warehouse

This shows theTemplate EDW for Communications and, in particular, Telephone Billing .Systems.

Page 18 20-05-2023 14:36

Page 19: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.4.2 Dimensional Model

This shows theTemplate Dimensional Model for Communications.

Page 19 20-05-2023 14:36

Page 20: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.4.3 Operational Data Stores (ODS)

This shows the ODS for Communications.

Page 20 20-05-2023 14:36

Page 21: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.5 Retail Sales

A.5.1 Data Warehouse

This shows the EDW for the analysis of Retail Sales.

Page 21 20-05-2023 14:36

Page 22: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.5.2 Dimensional Model

This shows the Dimensioanl Model for the Retail Sales.

A.5.3 Operational Data Stores (ODS)

This diagram shows a sample of extract data for Retail Sales.

Page 22 20-05-2023 14:36

Page 23: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.6 Transport

A.6.1 Data Warehouse

This shows the EDW for the analysis of Transport.

Page 23 20-05-2023 14:36

Page 24: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.6.2 Dimensional Model

This shows the Dimensional Model for the Transport Subject Area.

Page 24 20-05-2023 14:36

Page 25: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.6.3 Operational Data Stores (ODS)

This diagram shows a sample of extract data for Transport.

Page 25 20-05-2023 14:36

Page 26: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.7 Web Visit Statistics

A.7.1 Data Warehouse

This shows the EDW for the analysis of Transport.

A.7.2 Dimensional Model

This diagram shows a Dimensional Model for Web Visit Stats.

Page 26 20-05-2023 14:36

Page 27: databaseanswers.orgdatabaseanswers.org/downloads/Guidelines_for_Gettin… · Web viewA Common Data Model can be used to design a Data Warehouse in a series of incremental Steps that

Guidelines for Getting Started with Teradata from Barry

A.7.3 Operational Data Stores (ODS)

This diagram shows a sample of extract data for Web Visit Stats.

Page 27 20-05-2023 14:36