Upload
arijit-samanta
View
214
Download
0
Embed Size (px)
Citation preview
8/11/2019 SBIOVR
1/507
Introduction to theSAS
Business
Intelligence Client Tools
Course Notes
8/11/2019 SBIOVR
2/507
Introduction to the SASBusiness Intelligence Client Tools Course Notes was developed by
Kari Richardson and Eric Rossland. Additional contributions were made by Mark Craver, Christine Vitron,
Bobbie Wagoner, and Kathy Wisniewski. Editing and production support was provided by the
Curriculum Development and Support Department. Hardware support was provided by the Education
Information Systems Department.
SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks
of SAS Institute Inc. in the USA and other countries. indicates USA registration.
Other brand and product names are trademarks of their respective companies.
Introduction to the SASBusiness Intelligence Client Tools Course Notes
Copyright 2004 by SAS Institute Inc., Cary, NC 27513, USA. All rights reserved. Printed in the
United States of America. No part of this publication may be reproduced, stored in a retrieval system,
or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without
the prior written permission of the publisher, SAS Institute Inc.
Book code 60366, course code SBIOVR_003, prepared date 09Nov04.
8/11/2019 SBIOVR
3/507
For Your Information iii
Table of Contents
Course Description .......................... .......................... ........................... .......................... ........ vi
Prerequisites....................... ........................... .......................... ........................... ................... vii
General Conventions............................ ........................... .......................... ........................... . viii
Chapter 1 Introduction to the SAS
Intelligence Platform................................. 1-1
1.1 SAS Business Intelligence Overview ........................... .......................... ....................... 1-3
1.2 Business Intelligence Security............................ .......................... ........................... ..... 1-7
Chapter 2 Overview of the SAS
Business Intelligence Clients........................ 2-1
2.1 Introduction.............. ........................... ........................... .......................... ................... 2-3
2.2 BI Tools Java Clients....................... .......................... ........................... .................... 2-7
2.3 BI Tools Windows Clients............. .......................... ........................... ..................... 2-17
2.4 BI Tools Browser Clients........................... ........................... .......................... ........ 2-22
Chapter 3 Introduction to the Orion Star Data ................................................... 3-1
3.1 Overview....................... .......................... ........................... ........................... .............. 3-3
3.2 Data Models..... .......................... ........................... .......................... .......................... 3-12
3.3 Case Study ........................ .......................... ........................... .......................... ......... 3-16
Chapter 4 Overview of the SAS
Management Console.................................... 4-1
4.1 Introduction to the SAS Management Console....................................... ....................... 4-3
4.2 How SAS Management Console Works............ ........................... ........................... ...... 4-6
4.3 Introduction to SAS Management Console Plug-Ins ......................... .......................... 4-12
4.4 Working with the User Interface......... ......................... ......................... ...................... 4-15
8/11/2019 SBIOVR
4/507
iv For Your Information
Chapter 5 Using SAS
ETL Studio ...................................................................... 5-1
5.1 SAS ETL Studio Overview......... ........................... ............................ .......................... 5-3
5.2 The SAS ETL Studio Interface .......................... ........................... .......................... ..... 5-9
5.3 Advanced SAS ETL Studio Features (Self-Study) ........................ ............................ .. 5-84
5.4 Solutions to Exercises........ .......................... ........................... ........................... ........ 5-91
Chapter 6 SAS
OLAP Cube Studio.................................................................... 6-1
6.1 Overview of SAS OLAP Cube Studio........................... ............................ ................... 6-3
6.2 SAS OLAP Cube Studio Interface ........................ ........................... .......................... 6-17
6.3 SAS OLAP Cube Studio and Microsoft Excel......................... ........................... ........ 6-44
6.4 Solutions to Exercises........ .......................... ........................... ........................... ........ 6-56
Chapter 7 Working with SAS
Stored Processes............................................... 7-1
7.1 An Overview of Stored Processes............... .......................... ........................... ............. 7-3
7.2 Creating and Registering a Stored Process............................ .......................... ............ 7-17
7.3 Creating a Stored Process from a SAS ETL Studio Job (Self-Study).......................... 7-33
7.4 Solutions to Exercises........ .......................... ........................... ........................... ........ 7-40
Chapter 8 SAS
Enterprise Guide
3................................................................... 8-1
8.1 New Features of SAS Enterprise Guide 3....... .......................... .......................... .......... 8-3
8.2 Exploring SAS Enterprise Guide...................................... ......................... ................... 8-9
8.3 Using the Stored Process Manager .......................... .......................... ......................... 8-24
8.4 Working with OLAP Cubes in SAS Enterprise Guide........... ......................... ............. 8-42
Chapter 9 Using SAS
Add-In for Microsoft Office............................................. 9-1
9.1 Overview of the SAS Add-In for Microsoft Office........................ ................................ 9-3
8/11/2019 SBIOVR
5/507
For Your Information v
9.2 Using the SAS Add-In for Microsoft Office................. .......................... ....................... 9-6
9.3 Solutions to Exercises........ .......................... ........................... ........................... ........ 9-43
Chapter 10 SAS
Information Map Studio .......................................................... 10-1
10.1 Overview of the SAS Information Map Studio ......................... ......................... ......... 10-3
10.2 Using SAS Information Map Studio.............. .......................... .......................... ....... 10-12
10.3 Solutions to Exercises........ .......................... ........................... ........................... ...... 10-52
Chapter 11 Using SAS
Web Report Studio....................................................... 11-1
11.1 Overview of SAS Web Report Studio....... ............................ ........................... ........... 11-3
11.2 Using SAS Web Report Studio.................................... ........................... .................... 11-5
11.3 Solutions to Exercises.......................... .......................... ........................... ................11-27
Chapter 12 Using SAS
Information Delivery Portal.......................................... 12-1
12.1 Overview of the SAS Information Delivery Portal ........................ ......................... ..... 12-3
12.2 Using the SAS Information Delivery Portal ......................... ......................... ............ 12-14
8/11/2019 SBIOVR
6/507
vi For Your Information
Course Description
This training provides participants with a hands-on overview of features in SAS 9 in the areas of
foundation technologies, data warehousing, intelligent storage and business intelligence.
To learn more
A full curriculum of general and statistical instructor-based training is available
at any of the Institutes training facilities. Institute instructors can also provide
on-site training.
For information on other courses in the curriculum, contact the SAS Education
Division at 1-919-531-7321, or send e-mail to [email protected]. You can also
find this information on the Web at support.sas.com/training/ as well as in the
Training Course Catalog.
For a list of other SAS books that relate to the topics covered in this
Course Notes, USA customers can contact our SAS Publishing Department at
1-800-727-3228 or send e-mail to [email protected]. Customers outside the
USA, please contact your local SAS office.
Also, see the Publications Catalog on the Web at support.sas.com/pubs for a
complete list of books and a convenient order form.
8/11/2019 SBIOVR
7/507
For Your Information vii
Prerequisites
None.
8/11/2019 SBIOVR
8/507
viii For Your Information
General Conventions
This section explains the various conventions used in presenting text, SAS language syntax, and examples
in this book.
Typographical Conventions
You will see several type styles in this book. This list explains the meaning of each style:
UPPERCASE ROMAN is used for SAS statements and other SAS language elements when they
appear in the text.
italic identifies terms or concepts that are defined in text. Italic is also used for
book titles when they are referenced in text, as well as for various syntax and
mathematical elements.
bold is used for emphasis within text.
monospace is used for examples of SAS programming statements and for SAS character
strings. Monospace is also used to refer to variable and data set names, fieldnames in windows, information in fields, and user-supplied information.
select indicates selectable items in windows and menus. This book also uses icons
to represent selectable items.
Syntax Conventions
The general forms of SAS statements and commands shown in this book include only that part of the
syntax actually taught in the course. For complete syntax, see the appropriate SAS reference guide.
PROC CHARTDATA = SAS-data-set;
HBAR| VBARchart-variables;
RUN;
This is an example of how SAS syntax is shown in text:
PROCand CHART are in uppercase bold because they are SAS keywords.
DATA= is in uppercase to indicate that it must be spelled as shown.
SAS-data-set is in italic because it represents a value that you supply. In this case, the value must bethe name of a SAS data set.
HBARand VBARare in uppercase bold because they are SAS keywords. They are separated by avertical bar to indicate they are mutually exclusive; you can choose one or the other.
chart-variablesis in italic because it represents a value or values that you supply.
represents optional syntax specific to the HBAR and VBAR statements. The anglebrackets enclose the slash as well as optionsbecause if no options are specified you do not include
the slash.
RUNis in uppercase bold because it is a SAS keyword.
8/11/2019 SBIOVR
9/507
For Your Information ix
8/11/2019 SBIOVR
10/507
x For Your Information
8/11/2019 SBIOVR
11/507
Chapter 1 Introduction to theSASIntelligence Platform
1.1 SAS Business Intelligence Overview..........................................................................1-3
1.2 Business Intelligence Security ....................................................................................1-7
8/11/2019 SBIOVR
12/507
1-2 Chapter 1 Introduction to the SAS Intelligence Platform
8/11/2019 SBIOVR
13/507
1.1 SAS Business Intelligence Overview 1-3
1.1 SAS Business Intelligence Overview
3
What Is Business Intelligence?Business Intelligence is a set of software tools andapplications that enable business users and analysts tointeract with their company data in an easy, efficient, andeffective manner.
SAS Business Intelligence includes
a set of client applications designed for a specific typeof business or analyst
SAS server processes designed to provide specific
types of services for the client applications a centralized metadata management facility.
4
Multiple Tier Environment
The SAS Intelligence Platform consists of a multiple tierenvironment that is typically represented by the
client tier
middle tier
server tier.
ClientClient
TierTier
MiddleMiddle
TierTier
ServerServer
TierTier
8/11/2019 SBIOVR
14/507
1-4 Chapter 1 Introduction to the SAS Intelligence Platform
5
Client TierIn the most basic terms, if an application is installed onthe machine that the user is sitting at, that machine is the
client tier.
There are three different types of SAS client applications.
JavaJavaClientsClients
run in a Java runrun in a Java run--time environment (JRE) and aretime environment (JRE) and areinstalled on the machine where they will be used.installed on the machine where they will be used.
WindowsWindows
ClientsClientsrun in the Microsoft Windows environment and arerun in the Microsoft Windows environment and are
installed on the machine where they will be used.installed on the machine where they will be used.
...
BrowserBrowser
ClientsClientsrun in a Web browser by connecting to a Javarun in a Web browser by connecting to a Java
Application Server orApplication Server or ServletServlet Container on the middleContainer on the middle
tier. Only the Web browser is installed on the localtier. Only the Web browser is installed on the localmachine.machine.
The SAS Business Intelligence (BI) client tools cannot execute SAS code on their own. They have to
request code submission and other services from a SAS server.
6
Middle TierThe middle tier is where the Web applications reside and
execute. This tier also contains the infrastructure thatsupports the execution of these applications, such as the
Java Application Server or the Java Servlet Container
SAS Web Infrastructure Kit (WIK)
webDAV server.
Java Application Server orJava Servlet Container
Java Application Server or
Java Servlet Container
SAS Web Infrastructure KitSAS Web Infrastructure Kit
webDAV ServerwebDAV Server
WebDAV (Web-Based Distributed Authoring and Versioning) is a set of extensions to the HTTPprotocol that allows users to collaboratively edit and manage files on remote Web servers.
8/11/2019 SBIOVR
15/507
1.1 SAS Business Intelligence Overview 1-5
7
Server TierThe server tier is the machine where one or moreSAS servers is installed and accessed by the BI tools.
There are different types of SAS servers, including:
MetadataMetadata
ServerServerenables centralized metadata delivery andenables centralized metadata delivery and
management to SAS applications across themanagement to SAS applications across theenterprise.enterprise.
WorkspaceWorkspace
ServerServerexecutes SAS code on behalf of the clientexecutes SAS code on behalf of the client
applications.applications.
...
Stored ProcessStored Process
ServerServerexecutes and delivers results from SAS Storedexecutes and delivers results from SAS Stored
Processes.Processes.
OLAP ServerOLAP Server deliversdelivers presummarizedpresummarized cubes of data to OLAPcubes of data to OLAPclients.clients.
8
SAS Intelligence Platform
ClientTier
MiddleTier
ServerTier
SAS ETL StudioSAS OLAP Cube StudioSAS Management ConsoleSAS Information Map Studio
Web Infrastructure KitWeb Infrastructure Kit
Java ServletContainer
Java Servlet
ContainerwebDAVServer
webDAV
Server
HTTP ServerHTTP Server SDKSDK
SAS Enterprise GuideSAS Add-In for Microsoft Office
SAS Web Report Studio
SAS Information Delivery Portal
Workspace
Server
WorkspaceServer
Stored Process
Server
Stored ProcessServer
OLAP
Server
OLAPServer
MetadataServer
MetadataServer
SAS9Foundation
SAS9Foundation
SAS/CONNECTServer
SAS/CONNECTServer
This is simplified diagram of the SAS Intelligence Platform.
8/11/2019 SBIOVR
16/507
1-6 Chapter 1 Introduction to the SAS Intelligence Platform
9
xxxxSAS InformationDelivery Portal
xxxxSAS Web
Report Studio
xxxSAS Add-In forMicrosoft Office
xxxxSAS Enterprise
Guide
xxxSAS Information
Map Studio
xxSAS OLAP
Cube Studio
xxxSAS ETL Studio
SAS/
CONNECTServer
OLAPServer
Stored
ProcessServer
WorkspaceServer
MetadataServer
SAS Foundation Servers
Client Application
Clients Use of Foundation Servers
8/11/2019 SBIOVR
17/507
1.2 Business Intelligence Security 1-7
1.2 Business Intelligence Security
11
Security ConceptsIn the SAS Intelligence environment, security consists of:
AuthenticationAuthentication the process of verifying the identity of athe process of verifying the identity of a
person or process within the guidelinesperson or process within the guidelines
of a specific security policyof a specific security policy
AuthorizationAuthorization the process of determining whichthe process of determining which
permissions the user has for whichpermissions the user has for which
resources.resources.
...
Authentication is a prerequisite for authorization.
12
Security Concepts
An authentication provider is a technology that serversor applications can use to verify that users are who theysay they are.
By default, the authentication provider for a SAS server is
the host operating system of the machine on which theserver is running.
The SAS Metadata Server and SAS OLAP Server can also use alternative authentication providers.SAS Web applications run under third-party servers that can use a variety of authentication providers.
8/11/2019 SBIOVR
18/507
1-8 Chapter 1 Introduction to the SAS Intelligence Platform
8/11/2019 SBIOVR
19/507
Chapter 2 Overview of theSASBusiness Intelligence Clients
2.1 Introduction ..................................................................................................................2-3
2.2 BI Tools Java Clients.................................................................................................2-7
2.3 BI Tools Windows Clients .......................................................................................2-17
2.4 BI Tools Browser Clients.........................................................................................2-22
8/11/2019 SBIOVR
20/507
2-2 Chapter 2 Overview of the SAS Business Intelligence Clients
8/11/2019 SBIOVR
21/507
2.1 Introduction 2-3
2.1 Introduction
3
Client Tier ReviewIn the most basic terms, if an application is installed onthe machine that the user is sitting at, that machine is theclient tier.
There are three types of SAS client applications.
Java Clients
SAS ETL StudioSAS OLAP Cube StudioSAS Management ConsoleSAS Information Map Studio
Windows Clients
SAS Enterprise GuideSAS Add-In for Microsoft Office
Browser Clients
SAS Web Report StudioSAS Information Delivery Portal
...
4
Information ConsumersPower UsersIT SupportBusiness Analyst (Jacques)
No DBMS or programming
Strong Excel
Ad hoc queries
OLAP
Create reports
Publish reports
Custom reports
Understands business metrics
IT Admin (Ahmed)
Useradministration
Reportingadministration
Softwareadministration
Report Administrator(Robert)
Report Builder
Schedule reports
Monitor queue
Business Viewmanager
Understandsphysicaldata model
SQL programmer
Data Modeler(Marcel)
Power User (Gloria)
Some DBMS and programming Strong Excel
Ad hoc queries
Custom reports
Modeling
Analytics
Detail data Understands business domain
Business Intelligence Personas
Operational Consumers
Sales
Marketing
Customer Service
Finance Tech Support
C-level Execs
Annotation
Middle Management(Henri)
Drill down Manipulation
Annotation
8/11/2019 SBIOVR
22/507
2-4 Chapter 2 Overview of the SAS Business Intelligence Clients
5
IT Support
IT Admin (Ahmed)
Useradministration
Reportingadministration
Softwareadministration
Report Administrator(Robert)
Report Builder Schedule reports Monitor queue
Business Viewmanager
Understandsphysicaldata model
SQL programmer
Data Modeler(Marcel)
BI Clients Used by Information Technology
SAS InformationMap Studio
SAS ETL Studio
SAS Management Console
...
SAS Web Report Studio
These slides represent an example of which BI tools might be used by each group and does notserve as a recommendation.
6
Power Users
BI Clients Used by Power Users
SAS ETL Studio
SAS Enterprise Guide
SAS Add-In forMicrosoft Office SAS OLAP Cube Studio
...
SAS InformationMap Studio
SAS Web Report Studio
Business Analyst (Jacques)
No DBMS or programming
Strong Excel
Ad hoc queries
OLAP
Create reports
Publish reports Custom reports
Understands business metrics
Power User (Gloria)
Some DBMS and programming
Strong Excel
Ad hoc queries
Custom reports
Modeling
Analytics
Detail data
Understands business domain
8/11/2019 SBIOVR
23/507
2.1 Introduction 2-5
7
Information Consumers
Operational Consumers
Sales Marketing Customer Service Finance Tech Support
C-level Execs
Annotation
Middle Management(Henri)
Drill down
Manipulation
Annotation
BI Clients Used by Information Consumers
SAS Add-In forMicrosoft Office
SAS Web Report Studio
...
SAS InformationDelivery Portal
8
SAS Display ManagerThe traditional SAS windowing environment, SAS DisplayManager, is still available in SAS9 to develop and runSAS programs.
8/11/2019 SBIOVR
24/507
2-6 Chapter 2 Overview of the SAS Business Intelligence Clients
9
SAS Enterprise GuideSAS programs can also be developed using thepoint-and-click interface in SAS Enterprise Guide.
10
SAS 9.1 BI ClientsThe remainder of this chapter is a quick introduction to thebusiness intelligence client tools in SAS9.
All of the tools will be covered in-depth in the followingchapters:
SAS Management Console Chapter 4
SAS ETL Studio Chapter 5
SAS OLAP Cube Studio Chapter 6
SAS Enterprise Guide Chapter 8
SAS Add-In for Microsoft Office Chapter 9
SAS Information Map Studio Chapter 10
SAS Web Report Studio Chapter 11
SAS Information Delivery Portal Chapter 12
SAS Stored Processes will be discussed in Chapter 7.
8/11/2019 SBIOVR
25/507
2.2 BI Tools Java Clients 2-7
2.2 BI Tools Java Clients
12
BI Tools Java Clients
SAS Java clients run in a Java run-time environment
(JRE) and are installed on the machine where they will beused.
Client Tier
SAS ETL StudioSAS OLAP Cube StudioSAS Management ConsoleSAS Information Map Studio
SAS Enterprise GuideSAS Add-In for Microsoft Office
SAS Web Report StudioSAS Information Delivery Portal
13
SAS Management Console
8/11/2019 SBIOVR
26/507
2-8 Chapter 2 Overview of the SAS Business Intelligence Clients
14
SAS Management ConsoleThe SAS Management Console provides a singleinterface for many administrative tasks for the SAS
System. Specific administrative tasks are supported byplug-ins tothe SASManagementConsole.
15
Metadata Manager Plug-InThe Metadata Manager plug-in provides a mechanismto control the SAS Metadata Server.
8/11/2019 SBIOVR
27/507
2.2 BI Tools Java Clients 2-9
16
Data Library Manager Plug-InData resources can be identified and managed using theData Library Manager plug-in.
By identifying data sources in metadata, the administrator has the ability to control access and change
the location of data sources without changing the applications that use the data. Resources, tables, reports,
and stored processes can be secured by user or by group.
17
User Manager Plug-In
The User Manager plug-in is used to define and manageusers and groups.
A user can be a member of one or more groups.
8/11/2019 SBIOVR
28/507
2-10 Chapter 2 Overview of the SAS Business Intelligence Clients
18
Stored Process Manager Plug-InThe Stored Process Manager plug-in is used to registerstored processes, which are a special kind of SAS
program that can be run from a variety of SAS9 clients.
Many properties can be specified when defining a SAS program as a stored process, including where it
should run, where the program is stored, the name of the program, and how the results of the program will
behave.
19
Server Manager Plug-In
SAS servers and spawners can be defined, configured,and managed using the Server Manager plug-in.
8/11/2019 SBIOVR
29/507
8/11/2019 SBIOVR
30/507
2-12 Chapter 2 Overview of the SAS Business Intelligence Clients
22
SAS ETL Studio WizardsSAS ETL Studio Wizards can simplify common tasks forthe user including the definition of source and target
tables and the creation of OLAP cubes.
...
23
SAS ETL Studio Process EditorThe Process Editor provides a graphical user interface(GUI) for designing the jobs and creating the SAS codefor loading the tables in the data warehouse.
8/11/2019 SBIOVR
31/507
2.2 BI Tools Java Clients 2-13
24
SAS ETL Studio Source EditorThe Source Editor is used for viewing and editing the SAScode created for the jobs designed in the Process Editor.
25
SAS OLAP Cube Studio
8/11/2019 SBIOVR
32/507
2-14 Chapter 2 Overview of the SAS Business Intelligence Clients
26
SAS OLAP Cube StudioSAS OLAP Cube Studio is used to create OLAP cubes.The Cube Designer provides a point-and-click interface
for cube creation.
The same Cube Designer is also available in SAS ETL Studio.
27
SAS OLAP Cube StudioSource tables can be defined in SAS OLAP Cube Studiousing the same Source Designer found in SAS ETLStudio.
8/11/2019 SBIOVR
33/507
8/11/2019 SBIOVR
34/507
2-16 Chapter 2 Overview of the SAS Business Intelligence Clients
30
SAS Information Map StudioAn information map can create relationships betweentables and organize data efficiently for end users.
8/11/2019 SBIOVR
35/507
2.3 BI Tools Windows Clients 2-17
2.3 BI Tools Windows Clients
32
BI Tools Windows Clients
SAS Windows clients run in the Microsoft Windows
environment and are installed on the machine where theywill be used.
Client Tier
SAS ETL StudioSAS OLAP Cube Studio
SAS Management ConsoleSAS Information Map Studio
SAS Enterprise GuideSAS Add-In for Microsoft Office
SAS Web Report StudioSAS Information Delivery Portal
33
SAS Enterprise Guide
8/11/2019 SBIOVR
36/507
2-18 Chapter 2 Overview of the SAS Business Intelligence Clients
34
SAS Enterprise GuideSAS Enterprise Guide Version 3 has been completelyredesigned to be more intuitive and easier to navigate.
35
SAS Enterprise Guide and SAS StoredProcessesSAS Enterprise Guide can be used to run SAS StoredProcesses and is the preferred method for creating them.
8/11/2019 SBIOVR
37/507
2.3 BI Tools Windows Clients 2-19
36
SAS Enterprise Guide and OLAPSAS Enterprise Guide also provides a powerful viewerfor displaying OLAP cubes and navigating through the
dimensions.
37
SAS Add-In for Microsoft Office
8/11/2019 SBIOVR
38/507
2-20 Chapter 2 Overview of the SAS Business Intelligence Clients
38
SAS Add-In for Microsoft OfficeThe SAS Add-In for Microsoft Office provides access toSAS data sets from Excel.
39
SAS Add-In for Microsoft OfficeThe SAS Add-In for Microsoft Office provides manyoptions, including the ability to subset and/or sort data.
8/11/2019 SBIOVR
39/507
2.3 BI Tools Windows Clients 2-21
40
SAS Add-In for Microsoft OfficeThe SAS Add-In for Microsoft Office also allows SASStored Processes to be run on a server and the results
returned toeither Excelor Word.
SAS does not need to be installed on the local machine in order to access SAS data sets or SAS Stored
Processes using the SAS Add-In for Microsoft Office.
8/11/2019 SBIOVR
40/507
8/11/2019 SBIOVR
41/507
2.4 BI Tools Browser Clients 2-23
44
SAS Web Report StudioSAS Web Report Studiois a Web-based,
wizard-drivenreporting tool.
SAS Web Report Studio requires data to be defined in an Information Map and does not work directly
with a data table. Both tabular reports and graphs can be created using the Report Wizard or the Report
Builder.
45
SAS Web Report Studio Creating Reports
The Report Wizard uses a step-by-step process to helpcreate a report.
8/11/2019 SBIOVR
42/507
2-24 Chapter 2 Overview of the SAS Business Intelligence Clients
46
SAS Web Report Studio Viewing Reports
47
SAS Information Delivery Portal
8/11/2019 SBIOVR
43/507
8/11/2019 SBIOVR
44/507
2-26 Chapter 2 Overview of the SAS Business Intelligence Clients
50
SAS ID Portal Pages and PortletsThe pages and portlets of the SAS Information DeliveryPortal can surface all types of information including
information maps stored processes
reports.
51
SAS ID Portal CustomizationThe SAS Information Delivery Portal can be customizedwith logos, company colors, and so on.
8/11/2019 SBIOVR
45/507
Chapter 3 Introduction to theOrion Star Data
3.1 Overview .......................................................................................................................3-3
3.2 Data Models ................................................................................................................3-12
3.3 Case Study..................................................................................................................3-16
8/11/2019 SBIOVR
46/507
3-2 Chapter 3 Introduction to the Orion Star Data
8/11/2019 SBIOVR
47/507
3.1 Overview 3-3
3.1 Overview
3
Course DataThe data used in thiscourse is from Orion StarSports & Outdoors, which
is a fictitious retail company selling sports andoutdoor products
has retail stores in many countries throughout theworld
sells products in retail stores, through catalog mailorders, and over the Internet
utilizes the Orion Star Club to track purchases,enabling analyses of buying patterns and providinga basis for CRM activities and targeted productofferings.
8/11/2019 SBIOVR
48/507
3-4 Chapter 3 Introduction to the Orion Star Data
4
Orion Star OrganizationThe Orion Star organization hierarchy has several levels:
country
company
department
group.
The United States hosts the headquarters as well as most of the administrative functions. Each country
has a retail company associated with it, and all offices were operational within the whole time range of
the data (1998 through early 2003) except the Belgian office, which opened in 1999.
8/11/2019 SBIOVR
49/507
3.1 Overview 3-5
5
Orion Star EmployeesInformation stored for each employee includes:
employee ID
job title
hire date
terminationdate
address
gender
salary.
8/11/2019 SBIOVR
50/507
3-6 Chapter 3 Introduction to the Orion Star Data
6
Approximately 5,500 different sports and outdoorsproducts are offered at Orion Star. The products are
organized intothe hierarchy of:
product
line
productcategory
productgroup
product.
Orion Star Merchandise
8/11/2019 SBIOVR
51/507
3.1 Overview 3-7
7
Orion Star MerchandiseEach product has a cost and sales price. Price changehistory is maintained by recording start and end dates
for each price.
All prices arein U.S. dollars.
8/11/2019 SBIOVR
52/507
3-8 Chapter 3 Introduction to the Orion Star Data
8
Orion Star CustomersOrion Star has approximately 100,000 customers spreadacross many countries. Customer address information is
stored in anumber oftables andincludes
street
city
state
postal code
country.
Customer names, as well as street addresses, are fictitious, but real city names, county/region names, and
country names are used.
This data is controlled by means of pointers (ID columns), as is often seen where geographical data is
gathered from official sources that are maintained externally to the organization. This approach facilitates
changes to information such as postal codes and street names.
8/11/2019 SBIOVR
53/507
3.1 Overview 3-9
9
Orion Star OrdersMost of the approximately 750,000 orders are sales toOrion Star Club members for whom customer information
is recorded.
Each ordercontains oneor more order
lines (one lineper productpurchased).
8/11/2019 SBIOVR
54/507
3-10 Chapter 3 Introduction to the Orion Star Data
10
Orion Star SuppliersAll of the Orion Star products have a pointer to a supplier.There are 64 suppliers, based in different countries, but
only onesupplierper product.
8/11/2019 SBIOVR
55/507
3.1 Overview 3-11
11
Orion Star Information ConsumersThere are many people and groups at Orion Star who useSAS software, including
information systems business users
management andC-level executives.
The Information Systems Department has built a data warehouse by extracting data from Online
Transactional Processing Systems (OLTP) and Enterprise Resource Planning (ERP) systems, as
well as data from external sources.
The business users use the data warehouse
to create data marts
to perform advanced analyses
for querying and reporting.Management and C-level executives use the SAS Business Intelligence tools to access the informationstored in the data warehouse.
8/11/2019 SBIOVR
56/507
3-12 Chapter 3 Introduction to the Orion Star Data
3.2 Data Models
13
Orion Star Data ModelsThe Orion Star data is organized two ways:
NormalizedNormalized
(relational)(relational)
modelmodel
represents the extracts from OLTP,represents the extracts from OLTP,
ERP, and other systems. This data canERP, and other systems. This data can
be transformed, cleansed, and loadedbe transformed, cleansed, and loaded
into a data warehouse.into a data warehouse.
DimensionalDimensional
modelmodelrepresents a data warehouse thatrepresents a data warehouse that
consists of a star schema design. Thisconsists of a star schema design. This
data is well suited for use in businessdata is well suited for use in business
intelligence applications and for ad hocintelligence applications and for ad hocquerying and reporting.querying and reporting.
...
8/11/2019 SBIOVR
57/507
3.2 Data Models 3-13
14
The Normalized (Relational) Data Model
8/11/2019 SBIOVR
58/507
3-14 Chapter 3 Introduction to the Orion Star Data
The normalized data model contains a number of Start_Dateand End_Datecolumns that are used
to ensure an accurate history of changes. Therefore, an item can exist more than once in a particular tablebut only once within a particular time interval.
The Product and Organization tables are organized hierarchically. They use product-level and
organization-level IDs to determine the placement of an entity within the appropriate hierarchy.
8/11/2019 SBIOVR
59/507
3.2 Data Models 3-15
15
The Dimensional Data Model
The dimensional data model is the result of an ETL (Extract, Transform, and Load) process and represents
one possible data mart. The dimensional model is a star schema with a fact table that contains orders. The
dimension tables represent data for time, geography, customers, products, suppliers, and employees.
8/11/2019 SBIOVR
60/507
8/11/2019 SBIOVR
61/507
8/11/2019 SBIOVR
62/507
3-18 Chapter 3 Introduction to the Orion Star Data
8/11/2019 SBIOVR
63/507
8/11/2019 SBIOVR
64/507
8/11/2019 SBIOVR
65/507
8/11/2019 SBIOVR
66/507
4-4 Chapter 4 Overview of the SAS Management Console
4
What Is the SAS Management Console?SAS Management Console's single interface can be usedto perform the administrative tasks required to create and
maintain an integrated environment rather than using aseparate administrative interface for each application inyour enterprise intelligence environment.
5
Tasks Managed by SAS Management ConsoleSAS Management Console can be used to manage avariety of tasks, including
server definitions
library definitions user definitions
resource access controls
metadata repositories
SAS licenses
job schedules
XMLMaps.
8/11/2019 SBIOVR
67/507
4.1 Introduction to the SAS Management Console 4-5
6
Metadata DefinitionsSAS Management Console manages resources andcontrols them by creating and maintaining metadata
definitions for each resource or control.
The metadata definitions createdin SAS Management Console arestored in a repository on a SASMetadata Server, where they areavailable for other applicationsto use.
SAS Metadata Server
7
SAS Metadata ServerThe SAS Metadata Server is a multiuser server thatsurfaces metadata from one or more repositories toSAS Open Metadata Interface client applications via
the SAS Open Metadata Architecture.
The SAS Open Metadata Architecture enables
separate applications to exchange metadata
centralized management of an organization'smetadata resources.
The SAS Open Metadata Architecture is a general-purpose metadata management facility that providescommon metadata services for applications.
8/11/2019 SBIOVR
68/507
8/11/2019 SBIOVR
69/507
4.2 How SAS Management Console Works 4-7
10
Metadata ProfileWhen SAS Management Console is invoked, a metadataprofile is selected that specifies
the metadata server to which metadata definitions willbe written
the active metadata repository
information required to connect to the metadataserver.
When a metadata profile is defined, the server to which the application connects is specified along with
the user ID it uses to connect to the server. When an application opens a metadata profile, it has access
to the metadata on the specified server based on the security credentials of the user ID under which theconnection was made.
8/11/2019 SBIOVR
70/507
4-8 Chapter 4 Overview of the SAS Management Console
11
Connecting to SAS Metadata Servers
SAS Management Console
Metadata server DataServMetadata repository MainReposUser ID - sastest
Metadata Profile
Metadata Server DataServ
MetadataRepository
MainRepos
ARepos
BRepos
Metadata Server DevServ
...
After you open the metadata profile to connect to the metadata server, you can begin using the
SAS Management Console to create metadata definitions. As you create definitions, they are stored
in the active repository on the metadata server.
12
Connecting to SAS Metadata Servers
Each metadata server can contain multiple repositories,and the SAS Management Console can be used to storedefinitions in any of the repositories.
The active repository is the repository that is currentlyselected and is receiving metadata definitions from the
application.
8/11/2019 SBIOVR
71/507
4.2 How SAS Management Console Works 4-9
13
RepositoriesSAS Management Console can be used to set upa hierarchy of repositories on the metadata server,
consisting of different repository types including:
FoundationFoundation
RepositoriesRepositoriesspecify the parent repository for all otherspecify the parent repository for all other
repositories on the server. Thisrepositories on the server. This
repository contains resource definitionsrepository contains resource definitions
that are used throughout the enterprise.that are used throughout the enterprise.
ProjectProject
RepositoriesRepositoriesspecify development repositories thatspecify development repositories that
can be used for change management.can be used for change management.
...
A project repository is a development repository that has dependencies on a foundation or customrepository.
In addition to these two repository types, a third type can also be defined:
Custom
Repositories
specifies repositories that are dependent on the foundation repository or other
custom repositories and accesses and uses resources from the parent(s).
A repository that is dependent upon another repository inherits metadata from the repository on which itis dependent.
8/11/2019 SBIOVR
72/507
8/11/2019 SBIOVR
73/507
8/11/2019 SBIOVR
74/507
4-12 Chapter 4 Overview of the SAS Management Console
4.3 Introduction to SAS Management Console Plug-Ins
19
SAS Management Console Plug-InsSAS Management Console plug-ins are Java files thatprovide specific administrative functions by creatingspecific types of metadata definitions.
SAS Management Console includes a standard set ofplug-ins. Additional plug-ins can be installed to accessand manage other SAS or third-party applications.
20
SAS Management Console Plug-Ins
SAS Management Console plug-ins are grouped intotwo main categories:
Environment Managementplug-ins for configuring SAS resources such asservers, metadata, libraries, and users
Application Managementplug-ins for configuring SAS applications.
When a plug-in is installed, it is added automatically
to the appropriate category.
8/11/2019 SBIOVR
75/507
4.3 Introduction to SAS Management Console Plug-Ins 4-13
21
SAS Management Console Plug-InsSome of the plug-ins provided with SAS ManagementConsole include:
AuthorizationAuthorizationManagerManager
defines access rules to control how usersdefines access rules to control how usersand groups can access metadata definitions.and groups can access metadata definitions.
Data LibraryData Library
ManagerManagercreates definitions for SAS libraries andcreates definitions for SAS libraries and
database schemas.database schemas.
MetadataMetadata
ManagerManagerprovides functions for defining metadataprovides functions for defining metadata
repositories on the active server, managingrepositories on the active server, managing
resource templates, and creating jobs toresource templates, and creating jobs to
promote or replicate a metadata repository.promote or replicate a metadata repository.
ScheduleSchedule
ManagerManager
allows you to create schedules for runningallows you to create schedules for running
jobs created in SAS ETL Studio.jobs created in SAS ETL Studio.
...
Additional plug-ins include:
Server Manager creates definitions for servers, including SAS application servers, database
servers, and enterprise servers.
User Manager creates definitions for users and user groups.
License Manager allows you to view information about the SAS licenses installed on a machine, as
well as details about the machine.
XMLMap Manager allows you to import and create XMLMaps, which are XML files that describe
how the SAS XML LIBNAME engine should interpret XML markup.
8/11/2019 SBIOVR
76/507
4-14 Chapter 4 Overview of the SAS Management Console
22
Plug-In InformationBecause each plug-in has its own functions, theSAS Management Console menu bar and toolbar change
whenever you select a node within the navigation tree.
Different nodes within a single plug-in can activatedifferent menu items and tools.
The Help for each plug-in is only available when the plug-in is active. For example, in order to view the
Help for the Metadata Manager plug-in, you must first select the Metadata Manager plug-in or an object
inside the plug-in in the navigation tree.
8/11/2019 SBIOVR
77/507
4.4 Working with the User Interface 4-15
4.4 Working with the User Interface
24
SAS Management Console User Interface
Menu bar
Toolbar
Context bar
Navigation tree
Display area
Status line.
The SAS Management Console user interface consistsof six major parts:
8/11/2019 SBIOVR
78/507
8/11/2019 SBIOVR
79/507
4.4 Working with the User Interface 4-17
26
SAS Management Console User Interface
Toolbar
...
The toolbar provides a set of five standard tools for
moving up one level in the navigation tree
opening the selected object
copying
pasting
deleting.
In addition, plug-ins can also add tools to the toolbar. The tools that are specific to a plug-in are only
available when the plug-in is selected in the navigation tree. The plug-in can also provide different toolsdepending on the object that is selected under the plug-in. Refer to the Help on each of the plug-ins for
more information about plug-in tools.
8/11/2019 SBIOVR
80/507
4-18 Chapter 4 Overview of the SAS Management Console
27
SAS Management Console User Interface
Context Bar
...
The context bar identifies the active repository, which is the metadata repository to which you are
storing metadata definitions. You can select the repository that you want to access from the Repository
drop-down list.
8/11/2019 SBIOVR
81/507
4.4 Working with the User Interface 4-19
28
SAS Management Console User Interface
Navigation Tree
...
The navigation tree displays an organized list of all of the installed plug-ins and the objects that are
associated with each plug-in. The plug-ins are organized into two main categories:
Environment Management contains plug-ins for defining metadata that applies to the overall
SAS environment (such as servers, libraries, and metadata repositories).
Application Management contains plug-ins for defining metadata that applies to specific
applications.
Selecting a plug-in icon in the navigation tree activates the plug-in and displays information for that
plug-in in the display area. To display the objects associated with the plug-in in the navigation tree,click the plus sign next to the plug-in icon to expand its contents (not all plug-ins are expandable).
8/11/2019 SBIOVR
82/507
4-20 Chapter 4 Overview of the SAS Management Console
29
SAS Management Console User Interface
DisplayArea
...
The display area is used to display detailed information about objects selected in the navigation tree. The
information displayed depends not only on the object selected in the navigation tree, but also on the plug-
in that is active. Some plug-ins (such as the User Manager) utilize the display area for entering metadata.
If you select an object (plug-in or other object) in the navigation tree that contains sub-folders, the display
area lists the folders. If you select a folder in the navigation tree, the display area lists the objects that arecontained in the folder.
If you select an object in the navigation tree that does not contain any sub-folders, the display areadisplays any metadata definitions that are associated with that object. For example, if you select a server
definition in the navigation tree, the display area lists all of the connections that have been defined for the
server.
8/11/2019 SBIOVR
83/507
4.4 Working with the User Interface 4-21
30
SAS Management Console User Interface
Status Line
...
The status line, at the bottom of the SAS Management Console window, displays the following
information:
number of objects selected or the number of objects contained in the currently selected folderor plug-in
domain and user ID defined in the metadata profile used to log on to the active metadata server
machine and port number of the active metadata server.
8/11/2019 SBIOVR
84/507
8/11/2019 SBIOVR
85/507
4.4 Working with the User Interface 4-23
3. Enter the username and password provided by the instructor.
The values shown above will be different than those used in class.
The SAS Management Console opens:
8/11/2019 SBIOVR
86/507
4-24 Chapter 4 Overview of the SAS Management Console
Define a SAS Data Library
4. Expand the Data Library Managerplug-in.
5. Right-click on SAS Librariesand select New Libraryfrom the pop-up menu. The New LibraryWizard opens.
6. Select SAS Base Engine Libraryas the type of library to define.
Select .
7. Specify Orion Star Gold Libraryas theName
of the new library.
Select .
8/11/2019 SBIOVR
87/507
4.4 Working with the User Interface 4-25
8. Specify the properties of this library:
a. Type orgoldin the Libreffield.
b. Select to add a new path to the list of available items.
c. In the New Path Specification window, select .
Using the button only allows you to browse the machine where the SAS
Management Console is running. If the path is located on a different server, it must be
typed in manually.
8/11/2019 SBIOVR
88/507
4-26 Chapter 4 Overview of the SAS Management Console
d. In the Browse window, navigate to C:\Workshop\winsas\sbip\Orion_Star_15AUG03\orgold .
Select .
e. Select in the New Path Specification window.
f. Verify that is path is now in the Selected itemslist.
Select .
8/11/2019 SBIOVR
89/507
4.4 Working with the User Interface 4-27
9. If prompted, select SASMainas the SAS Application server.
Select .
10.Review the settings:
Select .
8/11/2019 SBIOVR
90/507
4-28 Chapter 4 Overview of the SAS Management Console
11.The new library should now appear in the display area of SAS Management Console.
12.To view the LIBNAME statement that will be created from the metadata, right-clickOrion Star Gold Libraryand select Display Libname.
Select .
8/11/2019 SBIOVR
91/507
4.4 Working with the User Interface 4-29
32
Register Metadata for Existing TablesWhen you create metadata to define a library in the SASManagement Console, you define the library only and not
any tables that it contains.
To import metadata to define an existing data source, usethe Source Designer.
33
Source DesignerThe Source Designer is a wizard-driveninterface that enables you to define thephysical layout of existing tables using a
data dictionary or metadata informationfrom the source system.
The result of running the Source Designersuccessfully is a metadata registrationthat describes the data source.
8/11/2019 SBIOVR
92/507
4-30 Chapter 4 Overview of the SAS Management Console
Register Metadata for Existing Tables
This demonstration shows how to register the metadata for the tables in the Source Tables Library.
1. To invoke the Source Designer, right-click Orion Star Gold Libraryand select ImportTables.
2. Select SASMainas the server to connect to, because it is the server where the data resides.
3. Enter the user name and password provided by the instructor.
Select .
8/11/2019 SBIOVR
93/507
8/11/2019 SBIOVR
94/507
4-32 Chapter 4 Overview of the SAS Management Console
6. Review the settings.
Select .
7. The tables are now listed in the display area of SAS Management Console.
8/11/2019 SBIOVR
95/507
4.4 Working with the User Interface 4-33
Set the Authorization Level for the New Library
The metadata about libraries and tables are available to all users by default; however, access to the data
within a table is not. Permissions must be added for the desired users or group of users to be able to viewthe data within a specific table or all tables in a specific library.
8. To set permissions for all tables in the new library, right-click Orion Star Gold Libraryandselect Properties.
9. When the Properties window opens, select Authorization.
8/11/2019 SBIOVR
96/507
4-34 Chapter 4 Overview of the SAS Management Console
10.On the Authorization tab, grant the Read permission to the PUBLIC group by selecting theappropriate check box.
11.Select to save the changes.
12.Select FileExitto close SAS Management Console.
8/11/2019 SBIOVR
97/507
Chapter 5 Using SASETL Studio
5.1 SAS ETL Studio Overview............................................................................................5-3
5.2 The SAS ETL Studio Interface .....................................................................................5-9
5.3 Advanced SAS ETL Studio Features (Self-Study) ....................................................5-84
5.4 Solutions to Exercises ...............................................................................................5-91
8/11/2019 SBIOVR
98/507
5-2 Chapter 5 Using SAS ETL Studio
8/11/2019 SBIOVR
99/507
5.1 SAS ETL Studio Overview 5-3
5.1 SAS ETL Studio Overview
3
What Is SAS ETL Studio?SAS ETL Studio, a Java application, is a visual designtool that helps organizations quickly build, implement,and manage ETL processes from source to destination,regardless of the data sources or platforms.
Users can standardize metadata across the organizationand perform in-depth transformations with minimalprogramming or manual work to meet enterprise dataintegration requirements and to support business andanalytic intelligence.
4
What Is SAS ETL Studio?
SAS ETL Studio enables you to perform the followingtasks:
the Extraction of data from operational data stores
the Transformation of this data
the Loading of the extracted data into your datawarehouse or data mart.
8/11/2019 SBIOVR
100/507
8/11/2019 SBIOVR
101/507
5.1 SAS ETL Studio Overview 5-5
An Example of working with change-management in SAS ETL Studio
The following is a general description of what it is like to work under change-management control in
SAS ETL Studio.
When you open a metadata profile whose default repository is change-managed, metadata in the
change-managed repository is displayed in the Inventory tree and the Custom tree. Metadata in theProject repository is displayed in the Project tree.
The Project tree contains any metadata that has been checked out of the change-managed repositoryand any new metadata objects that have been added.
Typically, users will not have the appropriate privilege to directly add or update metadata directly ina change-managed repository. You must check metadata objects out of and into the change-managed
repository.
To update an existing metadata object that is under metadata source control, use the Inventory tree orthe Custom tree to check out the object from the change-managed repository. The object will appearin the Project tree, where you can update the object's metadata.
After an object has been checked out by one person, it is locked so that it cannot be updated by
another person until the object has been checked back in. You do not have to check out a library in order to add metadata about a table in that library.
If two or more parent objects share a common object such as a table, a primary key, a note, or adocument, and you check out one of these parent objects, only you will be able to check out the
other parent objects that share the common object. (Other users will not be able to access the
common object that you have checked out, and the shared object is required in order to check outa parent object that uses that shared object.) For example, suppose that two jobs use the same source
table. If you check out Job 1, only you will be able to check out Job 2.
When you add a new metadata object, it goes directly into the Project repository. The object willappear in the Project tree, where you can update the object's default metadata.
The Fetch option is used to get a copy of a metadata object for testing purposes. The copied object is
not checked out, so the original object is not locked. The copied object can be modified, but it cannotbe checked in. Fetched items will remain in the Project repository until they are deleted.
When you are finished working with all objects in the Project repository (and the Project tree), usethe Check In feature to remove the objects from the Project repository and store them in the change-
managed repository. A check-in operation checks in all of the metadata objects that are in the Project
repository. You cannot check in some objects and leave other objects in the Project repository,
therefore it may be convenient to work with small sets of related objects in the Project repository.
To remove a metadata object from the Project repository, use the Delete option or the Undo CheckOut option. To remove a metadata object from both the Project repository and the change-managedrepository, use the Destroy option.
8/11/2019 SBIOVR
102/507
5-6 Chapter 5 Using SAS ETL Studio
7
SAS ETL Studio: Data Surveyor WizardsOptional Data Surveyor wizards can be licensed thatprovide access to the metadata in enterprise applications,
such as PeopleSoft
SAP R/3
Siebel
Oracle Applications.
8
SAS ETL Studio: Metadata CWM CompliantThe metadata maintained by SAS ETL Studio is CWM(Common Warehouse Metamodel) compliant andportable to other CWM-compliant applications. Likewise,
metadata from other CWM-compliant applications (that is,data modeling tools) can be imported easily into SAS ETLStudio.
For example, you could use a data modeling tool to create a model for a set of tables, save the model inCWM format, and then use the Metadata Importer wizard to import the model into a metadata repository.
In SAS ETL Studio, you could view the properties of each table and verify that the appropriate metadata
was imported. The tables could then be used in SAS ETL Studio jobs.
8/11/2019 SBIOVR
103/507
5.1 SAS ETL Studio Overview 5-7
9
SAS ETL Studio: Data QualitySAS ETL Studio is fully integrated with the data qualitysoftware from DataFlux Corporation. Both products now
use the same Quality Knowledge Base (QKB), whichcontains rules, routines, and schemes necessary tointegrate data quality into the ETL process.
The Process Library in SAS ETL Studio contains two data quality transformation templates: Create
Match Code and Apply Lookup Standardization. These templates enable you to increase the value
of your data through data analysis and data cleansing.
10
Extending SAS ETL Studio Functionality
The SAS ETL Studio functionality is extended by Javaplug-ins packaged with the product.
Further extensions can be implemented by
writing additional plug-ins(Java programming required)
using the Transformation Generator Wizard(no Java programming required).
For details about creating your own Java plug-ins, see the plug-in appendix to the SASETL Studio
User's Guide.
8/11/2019 SBIOVR
104/507
8/11/2019 SBIOVR
105/507
5.2 The SAS ETL Studio Interface 5-9
5.2 The SAS ETL Studio Interface
14
SAS ETL Studio: The InterfaceSAS ETL Studio is a Java client developed to control theETL process. The interface has several ease-of-usefeatures including
copy and paste in any text field
multiple windows can be open at one time (includingmultiple process flow diagrams)
Windows look and feel
wizard-driven interfaces.
15
Tools, Menus, and Online Help
SAS ETL Studio takes full advantage of toolbars and pull-down menus. The icons available on the toolbar dependon which window is active from within the interface.
Menus andTools
8/11/2019 SBIOVR
106/507
5-10 Chapter 5 Using SAS ETL Studio
16
The Shortcut BarOne of the most significant features of SAS ETL Studiois the new
process-drivenfunctionality.
Processes areavailable via a
Shortcut bar onthe far left sideof the mainSAS ETL Studiowindow.
Shortcut Bar
17
The Shortcut BarThe Shortcut bar is populated with icons for each task anETL user would typically perform, including:
Source DesignerSource Designer defines metadata about thedefines metadata about the
source(s) for a process.source(s) for a process.
Metadata ImporterMetadata Importer imports metadata from otherimports metadata from other
applications.applications.
Metadata ExporterMetadata Exporter exports metadata to be used byexports metadata to be used by
other applications.other applications.
Process DesignerProcess Designer defines metadata about the ETLdefines metadata about the ETL
processes.processes.
...
continued...
8/11/2019 SBIOVR
107/507
5.2 The SAS ETL Studio Interface 5-11
18
The Shortcut Bar
Target DesignerTarget Designer defines metadata about thedefines metadata about the
target table(s) to be created bytarget table(s) to be created by
the process.the process.
OptionsOptions provides numerous options forprovides numerous options for
the SAS ETL Studio user tothe SAS ETL Studio user to
customize the look and feel of thecustomize the look and feel of the
application.application.
...
19
Tree ViewThe SAS ETL Studio Tree View enables you to
view the metadataassociated with
the currentmetadatarepository
display differentviews or trees
of the currentrepository.
Tree View
8/11/2019 SBIOVR
108/507
5-12 Chapter 5 Using SAS ETL Studio
20
Tree ViewThere are several tabs available in the tree view area:
...
continued...
Inventory TreeInventory Tree lists the metadata objects in thelists the metadata objects in thedefault metadata repository (anddefault metadata repository (and
any dependant repositories),any dependant repositories),
organized byorganized by predeterminedpredetermined
groupings.groupings.
The items in the Inventory tree view are grouped according to a set of predefined groups that an ETL user
will typically define from within SAS ETL Studio (tables, OLAP cubes, jobs, and so on).
21
Tree View
...
continued...
Custom TreeCustom Tree lists the metadata objects in thelists the metadata objects in the
default metadata repositorydefault metadata repository
(and any dependant(and any dependant
repositories), organized byrepositories), organized by
useruser--defineddefined groupings ofgroupings of
objects.objects.
The items in the Custom tree view are grouped according to whatever groups the ETL user has defined
(Source Tables, Target Tables, Sales Cubes, and so on). These groups could be determined by object
types, intended uses for the objects, data marts, lines of business, and so on.
8/11/2019 SBIOVR
109/507
5.2 The SAS ETL Studio Interface 5-13
22
Tree View
Process Library TreeProcess Library Tree lists the available datalists the available datatransformations to be used intransformations to be used in
the ETL process.the ETL process.
...
Selecting one of the transformations creates a template in the process view, with drop zones into which
the user may drag-and-drop items.
Addititional tabs may be available in the tree view area depending on the selected options andsettings.
8/11/2019 SBIOVR
110/507
5-14 Chapter 5 Using SAS ETL Studio
23
Process Library TreeThe Process Library tree displays a collection oftransformation templates.
There are four collections (folders) of templates that areprovided with SAS ETL Studio:
Analysis
Data Transforms
Output
Publish.
The Analysis folder in the Process Library tree includes the following templates:
Correlations creates an output table containing correlation statistics
Correlations Report creates an HTML report containing summary correlation statistics
Distribution Analysis creates an output table containing a distribution analysis
Distribution Analysis Report creates an HTML report containing a distribution analysis
Frequency an output table containing frequency information
Frequency Report creates an HTML report containing frequency information
Summary Statistics creates an output table containing summary statistics
Summary Statistics Report creates an HTML report containing summary statistics
Summary Tables Report creates an HTML report containing summary tables
8/11/2019 SBIOVR
111/507
5.2 The SAS ETL Studio Interface 5-15
The Data Transforms folder in the Process Library tree includes the following templates:
Append creates a single target table by combining data from several source tables.
Apply Lookup
Standardizations
applies standardization schemes to the source data during the
transformation step.
Create Match Codes creates match codes for specified source columns.
Data Transfer moves data directly from one machine to another.
Data Validation cleanses data before it is added to a data warehouse or data mart.
Extract selects multiple sets of rows from a source and writes those rows to a
target.
Fact Table Lookup loads source data into a fact table and translates business keys intogenerated keys.
Key Effective Date enables change tracking in intersection tables.
Mining Results integrates a SAS Enterprise Miner model into a SAS ETL Studio datawarehouse. Typically used to create target tables from a SAS Enterprise
Miner model.
Return Code Check provides status-handling logic at a desired point in the process flowdiagram for a job.
SAS Rank ranks one or more numeric variables in the source and stores the ranks in
the target.
SAS Sort reads data from a source, sorts it, and writes the sorted data to a target.
SAS Splitter selects multiple sets of rows from one source and writes each set of rowsto a different target.
SCD Type 2 Locator loads source data into a dimension table, detects changes between source
and target rows, updates change tracking columns, and applies generatedkey values. This transformation implements slowly changing dimensions.
SQL Join selects multiple sets of rows from one or more sources and writes each set
of rows to a single target.
Standardize an output table containing data standardized to a particular number.
Surrogate Key
Generator
loads a target, adds generated whole number values to a surrogate key
column, and optionally sorts and saves the source based on the values inthe business key column(s).
Transpose creates an output table containing transposed data.
User-Written Code retrieves a user-written transformation.
8/11/2019 SBIOVR
112/507
8/11/2019 SBIOVR
113/507
5.2 The SAS ETL Studio Interface 5-17
24
Process Designer ViewThe Process Designer window is the workspace for
building ETL processes. The Process Designer viewappears as a final step in the Process Designer wizard.
Once the process is defined, the Process Designer viewis populated with icons that represent the chosenprocesses.
The Process Designer window can be used to
view SQL source code
review the SAS log (from submitting jobs)
view the resulting output from running a SAS job.
25
Process Designer and Overview Windows
Process
DesignerView
Overview
window
...
8/11/2019 SBIOVR
114/507
5-18 Chapter 5 Using SAS ETL Studio
26
Overview WindowThe Overview window shows you the complete processfrom the process view.
From within the Overview window, you can control whichpart of the process is displayed in the Process Viewwindow.
27
SAS ETL Studio WizardsThere are shortcuts which invoke wizards that aid theuser in performing various tasks with SAS ETL Studio.
Some of these wizards are Source Designer
Target Designer
New Job.
8/11/2019 SBIOVR
115/507
5.2 The SAS ETL Studio Interface 5-19
28
Source DesignerThe Source Designer is a wizard-driveninterface that enables you to define the
physical layout of existing tables using adata dictionary or metadata informationfrom the source system.
The result of running the Source Designer
successfully is a metadata registrationthat describes the data source.
Unlike the SAS Warehouse Administrator product, which requires that you create Operational Data
Definitions (ODDs) one table at a time, the Source Designer enables you to input the metadata from
multiple tables with one pass through the wizard.
If the table is stored in an ERP System, the corresponding Data Surveyor will be invoked. TheData
Surveyorcontains information about the metadata of the source system.
If the table is a flat file, you can use a wizard to define the record layout.
The Source Designer can be invoked by following these steps:
Click on the Source Designer icon from the Shortcut bar. Select ToolsSource Designer. Right-click on the Source Tablesgroup and select New ObjectSource Table.
8/11/2019 SBIOVR
116/507
5-20 Chapter 5 Using SAS ETL Studio
29
Target DesignerThe Target Designer is a wizard that
allows metadata to be entered for a target.
In designing the target table, you can
access any metadata about anysource tables and columns registeredin the metadata repository
override any metadata that wasimported from another source and addnew columns to the target table
create indexes on the target table
being created.
The target designer can also specify
the location of the target table table-specific options to be stored in the metadata.
The Target Designer can be invoked by following these steps:
Click on the Target Designer icon from the Shortcut bar. Select ToolsTarget Designer.
Right-click on the Target Tablesgroup and select New ObjectTarget Table.
8/11/2019 SBIOVR
117/507
5.2 The SAS ETL Studio Interface 5-21
30
Target DesignerThe person designing the target table has full control overthe type of table being built.
The types of targets that can be built include
database types that are supported by the
SAS/ACCESS products
SAS data sets (including both data files and data views)
SAS/SHARE data sets
SPDE tables.
SPDE Scalable Performance Data Engine
8/11/2019 SBIOVR
118/507
5-22 Chapter 5 Using SAS ETL Studio
31
New Job WizardThe New Job wizard enables you todefine the metadata necessary to run an
ETL process to load data into a target ortargets.
An empty job can be created, without importing metadata from any predefined target table, by clicking
without selecting the metadata for a table. Objects from the tree view can then be dragged
and dropped into the empty job.
You can invoke the New Job wizard by following these steps:
Click on the Process Designer icon from the Shortcut bar. Select ToolsProcess Designer.
Right-click on the Jobs group and select New ObjectJob.
8/11/2019 SBIOVR
119/507
5.2 The SAS ETL Studio Interface 5-23
32
Additional WizardsOther wizards available to provide assistance with varioustasks in SAS ETL Studio include
Metadata Importer Metadata Exporter
Cube Designer
Transformation Generator wizard.
You can also install optional data surveyor wizards, whichprovide access to the metadata in enterprise applications,such as PeopleSoft, SAP R/3, Siebel, and Oracle.
These wizards can be invoked from the Shortcut Bar or the Tools pull-down menu of SAS ETL Studio:
Metadata Importer enables you to import metadata from other applications that support the
Common Warehouse Metamodel (CWM) format. Optional bridges are
available for other formats.
Metadata Exporter enables you to export metadata to other applications that support the Common
Warehouse Metamodel (CWM) format. Optional bridges are available for other
formats.
Cube Designer is a target designer that enables you to create a cube, a data store that supports
Online Analytical Processing.
Transformation
Generator
enables you to create a user-written, SAS code transformation and make it
available in the Process Library tree.
8/11/2019 SBIOVR
120/507
5-24 Chapter 5 Using SAS ETL Studio
33
Options WindowThe Options window can be used to define standardsettings for the SAS ETL Studio interface.
There are several tabs in the Options window:
General
Process
Editor
Metadata Tree
SAS Server
Data Quality.
Using the Options window tabs, you can control the following:
General controls the overall display of SAS ETL Studio and the functionality of the wizards.
Process controls the look and feel of the process editor window.
Editor controls the look and feel of the SAS text editor for generated code.
Metadata Tree controls the tree view in the middle portion of SAS ETL Studio.
SAS Server specifies the SAS server to be used with SAS ETL Studio, and tests the connection to
the server.
Data Quality specifies options to control the locale, the types of files to be used by the data qualityplug-ins, and the location of the Quality Knowledge Base (QKB).
8/11/2019 SBIOVR
121/507
5.2 The SAS ETL Studio Interface 5-25
34
Course Case Study TasksRecall the case study tasks diagram discussed earlier.Each of these tasks involves either reading or writing (or
both) metadata.
Register
Source Tables
DefineData Libraries
CreateETL Jobs
DefineTarget Tables
Create
OLAP Cubes
View and AnalyzeData
CreateStored Processes
Create Reports
CreateInformation Maps
Use the Information
Delivery Portal
Metadata
35
SAS ETL Studio Case Study TasksSAS ETL Studio will concentrate on the following fourtasks:
Register
Source Tables
Define
Data Libraries
CreateETL Jobs
DefineTarget Tables
CreateOLAP Cubes
View and AnalyzeData
Create
Stored Processes
Create Reports
CreateInformation Maps
Use the InformationDelivery Portal
Metadata
8/11/2019 SBIOVR
122/507
5-26 Chapter 5 Using SAS ETL Studio
36
SAS ETL Studio Case Study
DefineData Libraries (+)
These tasks will be performed in sequence:
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.
37
SAS ETL Studio Case Study Setup Tasks
DefineData Libraries (+)
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.
Build Custom Tree Groupings
Libraries
JobsSource TablesTarget Tables
Define Additional Library Definitions
Target Tables Library
Source Tables Library
Demo
Exercises
Demo
Exercises
...
8/11/2019 SBIOVR
123/507
5.2 The SAS ETL Studio Interface 5-27
38
SAS ETL Studio Case Study Define Sources
DefineData Libraries (+)
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.
The Source Designerdefinesmetadata for the source tables.
Orders
Order_Item
Product_List
Demo
Exercises
...
39
SAS ETL Studio Case Study Define Targets
DefineData Libraries (+)
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.
The Target Designerdefinesmetadata for the target tables.
OrderFact
ProductDim
Demo*
Exercises
* Some derived columns for OrderFactare completed in the exercises.
...
8/11/2019 SBIOVR
124/507
5-28 Chapter 5 Using SAS ETL Studio
40
SAS ETL Studio Case Study Define Jobs
DefineData Libraries (+)
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.The Process Designerdefinesmetadata for jobs that contain theprocess flow diagrams necessaryto load the target tables.
Demo
Exercises
Populate theOrderFact table
Populate theProductDim table
...
41
Creating the OrderFact TableThe OrderFact table will be created from the Orders andOrder_Item tables.
Target Table
Source Tables
...
8/11/2019 SBIOVR
125/507
5.2 The SAS ETL Studio Interface 5-29
42
Creating the OrderFact TableThe source tables, Orders and Order_Item, will becombined using the SQL Join transformation.
SQL Join
The SQL Join will beused to define computedcolumns.
...
43
Creating the OrderFact TableThe table that is the result of the SQL Join will then beloaded into the OrderFact table.
Loader
...
8/11/2019 SBIOVR
126/507
5-30 Chapter 5 Using SAS ETL Studio
44
Creating the ProductDim Table
The ProductDim table will be created from the
Product_List table.
Target Table
Source Table
...
45
Creating the ProductDim Table
The Extract transformation will be used so that acomputed column can be defined.
SAS Extract
...
8/11/2019 SBIOVR
127/507
8/11/2019 SBIOVR
128/507
5-32 Chapter 5 Using SAS ETL Studio
Create a Logical Grouping and Adding a Library Definition
This demonstration shows how to create a logical grouping in the Custom tree view to organize yourwarehouse metadata. It also shows how to create a library definition and store it in the new logical group.
1. Invoke SAS ETL Studio by selecting StartAll ProgramsSASSAS ETL Studio 9.1.
2. Select the metadata profile BIArchitecture, then select .
8/11/2019 SBIOVR
129/507
5.2 The SAS ETL Studio Interface 5-33
3. If prompted, enter the username and password provided by the instructor.
Select .
The SAS ETL Studio desktop opens:
Select to close the Tip Of The Day window.
8/11/2019 SBIOVR
130/507
5-34 Chapter 5 Using SAS ETL Studio
The first step in using SAS ETL Studio to build the Orion Star marketing data mart is to create groupsin the Custom tree view. These groups are simply logical groupings of the metadata that comprise the
repository. There is no corresponding physical file structure for the groups that are created. The groups
can be defined using whatever logical grouping is relevant to your organization.
4. Select the Custom tab in the tree view area of SAS ETL Studio.
5. Select to expand the Foundation repository.
The Foundation repository already has an Ungrouped group, which already contains several items.
These were created during the configuration of the SAS Intelligence Architecture. Do not delete
these files, as some will be used in future demonstrations and exercises.
6. Right-click Foundationand select New Groupfrom the pop-up menu. The default name of the newgroup is Untitled.
8/11/2019 SBIOVR
131/507
5.2 The SAS ETL Studio Interface 5-35
7. Rename the new group to Libraries.
a. Right-click on the Untitledgroup.
b. Select Renamefrom the pop-up menu.
c. Type Librariesas the new name for the group.
8/11/2019 SBIOVR
132/507
5-36 Chapter 5 Using SAS ETL Studio
Add a Library Definition
8. From the Custom tab in the tree view area of the SAS ETL Studio desktop, select FileNew Object.
9. In the New Object Wizard, select Librarytype, then .
8/11/2019 SBIOVR
133/507
5.2 The SAS ETL Studio Interface 5-37
10.Under SAS Libraries, choose SAS Base Engine Library, then select .
11.Type Target Tables Libraryin the Name field and select .
8/11/2019 SBIOVR
134/507
8/11/2019 SBIOVR
135/507
5.2 The SAS ETL Studio Interface 5-39
14.Select to close the Browse window. Verify the path selected above is now in the Selected
items list.
Select .
8/11/2019 SBIOVR
136/507
5-40 Chapter 5 Using SAS ETL Studio
15.Select SASMainas the server, then .
16.Review the information and select .
8/11/2019 SBIOVR
137/507
5.2 The SAS ETL Studio Interface 5-41
The library definition is added to the Ungrouped group.
8/11/2019 SBIOVR
138/507
5-42 Chapter 5 Using SAS ETL Studio
17.Move the new library to the Librariesgroup using one of the methods below:
Click on Target Tables Library, drag-and-drop it on top of the Librariesgroup.
Right-click on Target Tables Library, select Group, expand Foundation, select
Libraries, select .
8/11/2019 SBIOVR
139/507
5.2 The SAS ETL Studio Interface 5-43
Exercises
Create several logical grouping in the Custom tree view to organize your warehouse metadata.
If you did not follow along with the demonstration, complete the steps of the demonstration to define the
Libraries group and Target Tables Library.
1. Create Logical Groups
Create the following logical groups:
Source Tables
Target Tables
Jobs
Libraries (if you did not follow along with the demonstration).
2. Add Library Definitions
Use the New Library Wizard to create a metadata definition for the following libraries and move themto the appropriate group:
Library Name: Source Tables Library
Libref: Srcdata
Library Type: SAS Base Engine Library
Path: C:\Workshop\winsas\sbip\Orion_Star_15AUG03\ordetail
Server: SASMain
8/11/2019 SBIOVR
140/507
5-44 Chapter 5 Using SAS ETL Studio
Library Name: Target Tables Library
(if you did not follow along with the demonstration)
Libref: Target
Library Type: SAS Base Engine Library
Path: C:\Workshop\winsas\sbiovr\DataMarts
This path needs to be created.
Server: SASMain
8/11/2019 SBIOVR
141/507
5.2 The SAS ETL Studio Interface 5-45
50
Using the Source DesignerThe Source Designer is a wizard that generates metadatafor one or more selected tables, based on the physical
structure of the table(s)
The Source Designer can be used to specify metadata forany existing table, not just tables used as data sources forETL jobs.
The Source Designer supports a number of specific data formats that have been licensed for your
site. The specific format will often give better results than the generic format, which has not beenoptimized for particular kinds of data. Use the format that most closely matches your data
whenever possible.
51
Using the Source DesignerThe Source Designer is an easy to use wizard interface.
...
8/11/2019 SBIOVR
142/507
5-46 Chapter 5 Using SAS ETL Studio
52
SAS ETL Studio Case Study Define Sources
DefineData Libraries (+)
Define SourceTables Metadata
Define TargetTables Metadata
Define andRun Jobs
1.
2.
3.
4.
The Source Designerdefinesmetadata for the source tables.
Orders
Order_Item
Product_List
Demo
Exercises
8/11/2019 SBIOVR
143/507