70
SAP BusinessObjects Data Services Getting Started Guide SAP BusinessObjects Data Services XI 3.2 SP1 (12.2.1)

Data sevice architecture

Embed Size (px)

DESCRIPTION

SAP data Service Architecture

Citation preview

Page 1: Data sevice architecture

SAPBusinessObjectsData ServicesGetting Started Guide

SAP BusinessObjects Data Services XI 3.2 SP1 (12.2.1)

Page 2: Data sevice architecture

© 2009 SAP AG. All rights reserved.SAP, R/3, SAPNetWeaver, Duet, PartnerEdge,ByDesign, SAP Business ByDesign, and other SAP products and services

Copyright

mentioned herein as well as their respective logos are trademarks or registeredtrademarks of SAP AG in Germany and other countries. Business Objects and theBusiness Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, WebIntelligence, Xcelsius, and other Business Objects products and services mentionedherein as well as their respective logos are trademarks or registered trademarksof Business Objects S.A. in the United States and in other countries. BusinessObjects is an SAP company.All other product and service names mentioned arethe trademarks of their respective companies. Data contained in this documentserves informational purposes only. National product specifications may vary.Thesematerials are subject to change without notice. These materials are provided bySAP AG and its affiliated companies ("SAP Group") for informational purposesonly, without representation or warranty of any kind, and SAP Group shall not beliable for errors or omissions with respect to the materials. The only warranties forSAP Group products and services are those that are set forth in the expresswarranty statements accompanying such products and services, if any. Nothingherein should be construed as constituting an additional warranty.

2009-10-24

Page 3: Data sevice architecture

ContentsOverview of SAP BusinessObjects Data Services 5Chapter 1

SAP BusinessObjects Data Services and the SAP BusinessObjects solutionportfolio........................................................................................................6Software benefits.........................................................................................7

Unification with the platform...................................................................7Ease of use and high productivity..........................................................8High availability and performance..........................................................8

Associated software.....................................................................................8SAP BusinessObjects Metadata Management......................................9

Interfaces.....................................................................................................9

Architecture 11Chapter 2

Standard components................................................................................12Designer...............................................................................................14Repository............................................................................................14Job Server............................................................................................15Engine..................................................................................................15Access Server......................................................................................16Address Server.....................................................................................16Administrator........................................................................................16Metadata Reports applications.............................................................17Metadata Integrator..............................................................................19Service.................................................................................................20SNMP Agent.........................................................................................21Adapter SDK........................................................................................21

Optional components.................................................................................21

SAP BusinessObjects Data Services Getting Started Guide 3

Page 4: Data sevice architecture

Multi-user..............................................................................................21Management tools.....................................................................................22

License Manager..................................................................................22Repository Manager.............................................................................22Server Manager....................................................................................22

Operating system platforms.......................................................................23Distributed architecture..............................................................................23

Host names and port numbers.............................................................25

Glossary 27Appendix A

Index 69

4 SAP BusinessObjects Data Services Getting Started Guide

Contents

Page 5: Data sevice architecture

Overview of SAPBusinessObjects DataServices

1

Page 6: Data sevice architecture

About this section

This section introduces SAP BusinessObjects Data Services and explainsits place in the SAP BusinessObjects solution portfolio.Related Topics• SAP BusinessObjects Data Services and the SAP BusinessObjects solutionportfolio• Software benefits• Interfaces

SAP BusinessObjects Data Services andthe SAP BusinessObjects solutionportfolio

The SAP BusinessObjects solution portfolio delivers extreme insight throughspecialized end-user tools on a single, trusted business intelligence platform.This entire platform is supported by SAP BusinessObjects Data Services.On top of SAP BusinessObjects Data Services, the SAP BusinessObjectssolution portfolio layers the most reliable, scalable, flexible, and manageablebusiness intelligence (BI) platform which supports the industry's bestintegrated end-user interfaces: reporting, query and analysis, andperformance management dashboards, scorecards, and applications.

True data integration blends batch extraction, transformation, and loading(ETL) technology with real-time bi-directional data flow across multipleapplications for the extended enterprise.

By building a relational datastore and intelligently blending direct real-timeand batch data-access methods to access data from enterprise resourceplanning (ERP) systems and other sources, SAP has created a powerful,high-performance data integration product that allows you to fully leverageyour ERP and enterprise application infrastructure for multiple uses.

SAP provides a batch and real-time data integration system to drive today'snew generation of analytic and supply-chain management applications. Usingthe highly scalable data integration solution provided by SAP, your enterprisecan maintain a real-time, on-line dialogue with customers, suppliers,

6 SAP BusinessObjects Data Services Getting Started Guide

Overview of SAP BusinessObjects Data Services1 SAP BusinessObjects Data Services and the SAP BusinessObjects solution portfolio

Page 7: Data sevice architecture

employees, and partners, providing them with the critical information theyneed for transactions and business analysis.

Software benefitsUse SAP BusinessObjects Data Services to develop enterprise dataintegration for batch and real-time uses. With the software:

• You can create a single infrastructure for batch and real-time datamovement to enable faster and lower cost implementation.

• Your enterprise can manage data as a corporate asset independent ofany single system. Integrate data across many systems and reuse thatdata for many purposes.

• You have the option of using pre-packaged data solutions for fastdeployment and quick ROI. These solutions extract historical and dailydata from operational systems and cache this data in open relationaldatabases.

The software customizes and manages data access and uniquely combinesindustry-leading, patent-pending technologies for delivering data to analytic,supply-chain management, customer relationship management, and Webapplications.

Unification with the platform

SAP BusinessObjects Data Services provides several points of platformunification:

• Get end-to-end data lineage and impact analysis

• Create the semantic layer (universe) and manage change within the ETLdesign environment

SAP deeply integrates the entire ETL process with the business intelligenceplatform so you benefit from:

• Easy metadata management

• Simplified and unified administration

• Life cycle management

SAP BusinessObjects Data Services Getting Started Guide 7

1Overview of SAP BusinessObjects Data ServicesSoftware benefits

Page 8: Data sevice architecture

• Trusted information

Ease of use and high productivity

SAP BusinessObjects Data Services combines both batch and real-timedata movement andmanagement to provide a single data integration platformfor information management from any information source, for any informationuse.

Using the software, you can:

• Stage data in an operational datastore, data warehouse, or data mart.

• Update staged data in batch or real-time modes.

• Create a single graphical development environment for developing, testing,and deploying the entire data integration platform.

• Manage a single metadata repository to capture the relationships betweendifferent extraction and access methods and provide integrated lineageand impact analysis.

High availability and performance

The high-performance engine and proven data movement and managementcapabilities of SAP BusinessObjects Data Services include:

• Scalable, multi-instance data-movement for fast execution

• Load balancing

• Changed-data capture

• Parallel processing

Associated softwareChoose from other SAP BusinessObjects solution portfolio software optionsto further support and enhance the power of your SAP BusinessObjects DataServices software.

8 SAP BusinessObjects Data Services Getting Started Guide

Overview of SAP BusinessObjects Data Services1 Associated software

Page 9: Data sevice architecture

SAP BusinessObjects Metadata Management

SAP BusinessObjects Metadata Management provides an integrated viewof metadata and its multiple relationships for a complete Business Intelligenceproject spanning some or all of the SAP BusinessObjects solution portfolio.Use the software to:

• Viewmetadata about reports, documents, and data sources from a singlerepository.

• Analyze lineage to determine data sources of documents and reports.

• Analyze the impact of changing a source table, column, element, or fieldon existing documents and reports.

• Track different versions (changes) to each object over time.

• View operational metadata (such as the number of rows processed andCPU utilization) as historical data with a datetime.

• View metadata in different languages.

For more information on SAP BusinessObjects Metadata Management,contact your SAP sales representative.

InterfacesSAP BusinessObjects Data Services provides many types of interfacecomponents. Your version of the software may provide some or all of them.

You can use the Interface Development Kit to develop adapters that readfrom and/or write to other applications.

In addition to the interfaces listed above, the Nested Relational Data Model(NRDM) allows you to apply the full power of SQL transforms to manipulate,process, and enrich hierarchical business documents.

For a detailed list of supported environments and hardware requirements,see theSupported Platforms document available in the SAP BusinessObjectsSupport > Documentation > Supported Platforms/PARs section of the SAPService Marketplace: https://service.sap.com/bosap-support. This documentincludes specific version and patch-level requirements for databases,applications, web application servers, web browsers, and operating systems.

SAP BusinessObjects Data Services Getting Started Guide 9

1Overview of SAP BusinessObjects Data ServicesInterfaces

Page 10: Data sevice architecture

Related Topics• Designer Guide: Nested Data

10 SAP BusinessObjects Data Services Getting Started Guide

Overview of SAP BusinessObjects Data Services1 Interfaces

Page 11: Data sevice architecture

Architecture

2

Page 12: Data sevice architecture

This section describes SAP BusinessObjects Data Services componentsand their distribution on your network.

This section contains the following topics:

• Standard components

• Optional components

• Management tools

• Operating system platforms

• Distributed architecture

The architecture is layered to allow data integration to occur over a varietyof open, industry-standard APIs for optimal data and metadata management.Related Topics• Standard components• Optional components• Management tools• Operating system platforms• Distributed architecture

Standard componentsThe following diagram summarizes the relationships among SAPBusinessObjects Data Services components.

12 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Standard components

Page 13: Data sevice architecture

For a detailed list of supported environments and hardware requirements,see theSupported Platforms document available in the SAP BusinessObjectsSupport > Documentation > Supported Platforms/PARs section of the SAPService Marketplace: https://service.sap.com/bosap-support . This documentincludes specific version and patch-level requirements for databases,applications, web application servers, web browsers, and operating systems.Related Topics• Designer• Repository• Job Server• Engine• Access Server• Address Server• Administrator

SAP BusinessObjects Data Services Getting Started Guide 13

2ArchitectureStandard components

Page 14: Data sevice architecture

• Metadata Reports applications• Service• SNMP Agent• Adapter SDK

Designer

The Designer is a development tool with an easy-to-use graphical userinterface. It enables developers to define data management applications thatconsist of data mappings, transformations, and control logic.

Use the Designer to create applications containing work flows (job executiondefinitions) and data flows (data transformation definitions).

To use the Designer, create objects, then drag, drop, and configure them byselecting icons in flow diagrams, table layouts, and nested workspace pages.The objects in the Designer represent metadata. The Designer interfaceallows you to manage metadata stored in a repository. From the Designer,you can also trigger the Job Server to run your jobs for initial applicationtesting.Related Topics• Repository• Job Server

Repository

The SAP BusinessObjects Data Services repository is a set of tables thathold user-created and predefined system objects, source and target metadata,and transformation rules. Set up repositories on an open client/server platformto facilitate sharing metadata with other enterprise tools. Store each repositoryon an existing RDBMS.

Each repository is associated with one or more Job Servers which run thejobs you create. There are two types of repositories:

• A local repository is used by an application designer to store definitionsof objects (like projects, jobs, work flows, and data flows) and source/targetmetadata.

14 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Standard components

Page 15: Data sevice architecture

• A central repository is an optional component that can be used to supportmulti-user development. The central repository provides a shared objectlibrary allowing developers to check objects in and out of their localrepositories.

Job Server

The SAP BusinessObjects Data Services Job Server starts the datamovement engine that integrates data frommultiple heterogeneous sources,performs complex data transformations, and manages extractions andtransactions from ERP systems and other sources. The Job Server canmovedata in either batch or real-timemode and uses distributed query optimization,multi-threading, in-memory caching, in-memory data transformations, andparallel processing to deliver high data throughput and scalability.

While designing a job, you can run it from the Designer which tells the JobServer to run the job. The Job Server gets the job from its associatedrepository, then starts an engine to process the job. In your productionenvironment, the Job Server runs jobs triggered by a scheduler or by areal-time service managed by the Access Server. In production environments,you can balance job loads by creating a Job Server Group (multiple JobServers) which executes jobs according to overall system load.Related Topics• Engine• Access Server

Engine

When SAPBusinessObjects Data Services jobs are executed, the Job Serverstarts engine processes to perform data extraction, transformation, andmovement. The engine processes use parallel processing and in-memorydata transformations to deliver high data throughput and scalability.

SAP BusinessObjects Data Services Getting Started Guide 15

2ArchitectureStandard components

Page 16: Data sevice architecture

Access Server

The SAP BusinessObjects Data Services Access Server is a real-time,request-reply message broker that collects message requests, routes themto a real-time service, and delivers a message reply within a user-specifiedtime frame. The Access Server queues messages and sends them to thenext available real-time service across any number of computing resources.This approach provides automatic scalability because the Access Servercan initiate additional real-time services on additional computing resourcesif traffic for a given real-time service is high. You can configure multipleAccess Servers.

Address Server

The SAP BusinessObjects Data Services Address Server provides addressvalidation and correction for the Global Address Cleanse EMEA engine andGlobal Suggestion Lists. The Address Server must be started prior toprocessing data flows that contain the Global Suggestion List transform orthe Global Address Cleanse transform with the EMEA engine enabled.

Administrator

The Administrator provides browser-based administration of SAPBusinessObjects Data Services resources including:

• Scheduling, monitoring, and executing batch jobs

• Configuring, starting, and stopping real-time services

• Configuring Job Server, Access Server, and repository usage

• Configuring and managing adapters

• Managing users

• Publishing batch jobs and real-time services via Web services

16 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Standard components

Page 17: Data sevice architecture

Metadata Reports applications

The Metadata Reports applications provide browser-based analysis andreporting capabilities on metadata that is associated with:

• your SAP BusinessObjects Data Services jobs

• other SAP BusinessObjects solution portfolio applications associated withSAP BusinessObjects Data Services

Metadata Reports provide four applications for exploring your metadata:

• Impact and lineage analysis

• Operational dashboards

• Auto documentation

• Data validation

Impact and Lineage Analysis reports

Impact and Lineage Analysis reports include:

• Datastore Analysis — For each datastore connection, view overview,table, function, and hierarchy reports. SAP BusinessObjects Data Servicesusers can determine:

• What data sources populate their tables

• What target tables their tables populate

• Whether one or more of the following SAP BusinessObjects solutionportfolio reports uses data from their tables:

• Business Views

• Crystal Reports

• SAP BusinessObjects BW Universes Builder

• SAP BusinessObjects Web Intelligence documents

• SAP BusinessObjects Desktop Intelligence documents

SAP BusinessObjects Data Services Getting Started Guide 17

2ArchitectureStandard components

Page 18: Data sevice architecture

• Universe analysis — View Universe, class, and object lineage. Universeusers can determine what data sources populate their Universes andwhat reports use their Universes.

• Business View analysis — View the data sources for Business Views inthe Central Management Server (CMS). You can view business elementand business field lineage reports for each Business View. CrystalBusiness View users can determine what data sources populate theirBusiness Views and what reports use their views.

• Report analysis — View data sources for reports in the CentralManagement Server (CMS). You can view table and column lineagereports for each Crystal Report andWeb Intelligence Document managedby CMS. Report writers can determine what data sources populate theirreports.nic

• Dependency analysis — Search for specific objects in your repositoryand understand how those objects impact or are impacted by other SAPBusinessObjects Data Services or SAP BusinessObjects BW UniverseBuilder objects and reports. Metadata search results provide links backinto associated reports.

To view impact and lineage analysis for SAP BusinessObjects solutionportfolio applications, you must configure the Metadata Integrator.Related Topics• Installation Guide: Installing and Configuring the Metadata Integrator

Operational Dashboard reports

Operational dashboard reports provide graphical depictions of SAPBusinessObjects Data Services job execution statistics. This feedback allowsyou to view at a glance the status and performance of your job executionsfor one or more repositories over a given time period. You can then use thisinformation to streamline and monitor your job scheduling and managementfor maximizing overall efficiency and performance.

Auto Documentation reports

Auto documentation reports provide a convenient and comprehensive wayto create printed documentation for all of the objects you create in SAP

18 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Standard components

Page 19: Data sevice architecture

BusinessObjects Data Services. Auto documentation reports capture criticalinformation for understanding your jobs so you can see at a glance the entireETL process.

After creating a project, you can use Auto documentation reports to quicklycreate a PDF or Microsoft Word file that captures a selection of job, workflow, and/or data flow information including graphical representations andkey mapping details.

Data Validation dashboard

Data Validation dashboard reports provide graphical depictions that let youevaluate the reliability of your target data based on the validation rules youcreated in your SAPBusinessObjects Data Services batch jobs. This feedbackallows business users to quickly review, assess, and identify potentialinconsistencies or errors in source data.

Metadata Integrator

The Metadata Integrator allows SAP BusinessObjects Data Services toseamlessly share metadata with SAP BusinessObjects business intelligence(BI) solutions. Run the Metadata Integrator to collect metadata into the SAPBusinessObjects Data Services repository for Business Views and Universesused by Crystal Reports, SAP BusinessObjects Desktop Intelligencedocuments, and SAP BusinessObjects Web Intelligence documents.

SAP BusinessObjects Data Services Getting Started Guide 19

2ArchitectureStandard components

Page 20: Data sevice architecture

Service

The SAP BusinessObjects Data Services Service is installed when Job andAccess Servers are installed. The Service starts Job Servers and AccessServers when you restart your system. The Windows service name is DataServices Service. The UNIX equivalent is a daemon named AL_JobService.

20 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Standard components

Page 21: Data sevice architecture

SNMP Agent

SAP BusinessObjects Data Services error events can be communicatedusing applications supported by simple network management protocol(SNMP) for better error monitoring. Install an SAP BusinessObjects DataServices SNMP agent on any computer running a Job Server. The SNMPagent monitors and records information about the Job Servers and jobsrunning on the computer where the agent is installed. You can configurenetwork management software (NMS) applications to communicate with theSNMP agent. Thus, you can use your NMS application to monitor the statusof jobs.

Adapter SDK

The SAP BusinessObjects Data Services Adapter SDK provides a Javaplatform for rapid development of adapters to other applications andmiddleware products such as EAI systems. Adapters use industry-standardXML and Java technology to ease the learning curve. Adapters provide allnecessary styles of interaction including:

• reading, writing, and request-reply from SAP BusinessObjects DataServices to other systems

• request-reply from other systems to SAP BusinessObjects Data Services

Optional components

Multi-user

SAP BusinessObjects Data Services Multi-user is an advanced optionalcomponent that enables your development team to work together oninterdependent parts of an application through all phases of development.While each user works on applications in a unique local repository, the teamuses a central repository to store the master copy of the entire project. Thecentral repository preserves all versions of an application's objects, so youcan revert to a previous version if needed.

SAP BusinessObjects Data Services Getting Started Guide 21

2ArchitectureOptional components

Page 22: Data sevice architecture

Multi-user development includes other advanced features such as labelingand filtering to provide you with more flexibility and control in managingapplication objects.

For more details, see the Management Console: Administrator Guide andthe Advanced Development Guide.

Management toolsSAP BusinessObjects Data Services has several management tools to assistyou in managing your components.

License Manager

The License Manager displays the SAP BusinessObjects Data Servicescomponents for which you currently have a license.

Repository Manager

The Repository Manager allows you to create, upgrade, and check theversions of local and central repositories.

Server Manager

The Server Manager allows you to add, delete, or edit the properties of JobServers and Access Servers. It is automatically installed on each computeron which you install a Job Server or Access Server.

Use the Server Manager to define links between Job Servers and repositories.You can link multiple Job Servers on different machines to a single repository(for load balancing) or each Job Server to multiple repositories (with onedefault) to support individual repositories (separating test from production,for example).

You can also specify a Job Server as SNMP-enabled.

22 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Management tools

Page 23: Data sevice architecture

The Server Manager is also where you specify SMTP server settings for thesmtp_to email function..Related Topics• Designer Guide: Monitoring Jobs, SNMP support• Reference Guide: To define and enable the smtp_to function

Operating system platformsFor a detailed list of supported environments and hardware requirements,see theSupported Platforms document available in the SAP BusinessObjectsSupport > Documentation > Supported Platforms/PARs section of the SAPService Marketplace: https://service.sap.com/bosap-support . This documentincludes specific version and patch-level requirements for databases,applications, web application servers, web browsers, and operating systems.

Distributed architectureSAPBusinessObjects Data Services has a distributed architecture. An AccessServer can serve multiple Job Servers and repositories. The multi-userlicensed extension allowsmultiple Designers to work from a central repository.The following diagram illustrates both of these features.

SAP BusinessObjects Data Services Getting Started Guide 23

2ArchitectureOperating system platforms

Page 24: Data sevice architecture

You can distribute software components across multiple computers, subjectto the following rules:

• Engine processes run on the same computer as the Job Server thatspawns them

• Adapters require a local Job Server

Distribute components across a number of computers to best support thetraffic and connectivity requirements of your network. You can create aminimally distributed system, designed for developing and testing or a highlydistributed system designed to scale with the demands of a productionenvironment.

24 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Distributed architecture

Page 25: Data sevice architecture

Host names and port numbers

Communication between a Web application, the Access Server, the JobServer, and real-time services occurs through TCP/IP connections specifiedby IP addresses (or host names) and port numbers.

If your network does not use static addresses, use the name of the computeras the host name. If connecting to a computer that uses a static IP address,use that number as the host name for Access Server and Job Serverconfigurations.

To allow for a highly scalable system, each component maintains its own listof connections. You define these connections through the Server Manager,the Administrator, Repository Manager, and the Message Client library calls(from Web client).Related Topics• Installation Guide: Preparing to Install the software, Check port assignments

SAP BusinessObjects Data Services Getting Started Guide 25

2ArchitectureDistributed architecture

Page 26: Data sevice architecture

26 SAP BusinessObjects Data Services Getting Started Guide

Architecture2 Distributed architecture

Page 27: Data sevice architecture

Glossary

A

Page 28: Data sevice architecture

ABAPAdvanced Business Application Programming. A fourth-generationprogramming language developed by SAP in which SAPApplicationsare written.

ABAP data flowA data flow that extracts data from an SAP Applications source table.Data Services translates steps you define in an ABAP data flow intoABAP and then passes the ABAP program back to your SAPApplication system for execution. The resulting table or file resideson the SAP Application system to be used as a source in the parentdata flow.

ABAP programA program that executes database operations on an SAPApplications server. Data Services ABAP data flows generate ABAPprograms.

Access ServerThe Access Server dispatches requests to real-time services,ensuring optimal load balancing and complete life cyclemanagement.

AdapterAn external Data Services interface. There are two types of adapters:• Custom adapters — Adapters developed using the Adapter SDK

(Software Development Kit)• Prepackaged adapters — Adapters prebuilt and purchased from

SAP, such as the Data Services Salesforce.com adapter

Address CleanseTransforms that produce a correct and complete standardized formof an input address. The transform can also assign codes for postalautomation and append other useful address information.

address lineA line of data in an address that contains the primary and, possibly,secondary address. The primary address contains components suchas the primary range, primary name, directionals (post- and pre-),and the suffix. The secondary address normally contains componentssuch as the unit designator and the secondary range.

Address Server

28 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 29: Data sevice architecture

A process that provides address validation and correction for theGlobal Address Cleanse transform's EMEA engine and GlobalSuggestion Lists transform.

AdministratorA browser-based system administration application on the DataServices Management Console. Use the Administrator to do thefollowing:• Execute, schedule and monitor batch jobs• Add connections to repositories• Configure the profiler• Define users for multi-user development (central repository)• Manage the retention of logs files• Monitor Access Server status and inbound/outbound messages• Configure Adapter instances (a prerequisite for creating adapter

datastores)• Configure SAP application client interfaces (to read IDocs)• Configure, start, stop and monitor real-time services• Configure Data Services jobs callable as webservices and

generate WSDL• Set up the SAP RFC Server (to load data into or read data from

an SAPNetWeaver BW system).

after-imageThe values in an UPDATE row after the row changes. You usebefore- and after-images of UPDATE rows for log-basedchanged-data capture (CDC) jobs which Data Services supports.

aggregate functionA function that summarizes data (sums, calculates an average,identifies a maximum value, and so on). Where possible, DataServices pushes down the execution of the aggregate function tothe underlying Relational Database Management System).

aggregated dataData that results when a process combines elements. This data canbe presented collectively or in summary form.

ALE (Application Link Enabling)An SAP Applications programming-related interface designed toallow reliable communication across a distributed environment.Implemented in Data Services with the iDoc interface.

SAP BusinessObjects Data Services Getting Started Guide 29

AGlossary

Page 30: Data sevice architecture

aliasAlternate form or name. Data Services uses aliases in multiple ways,including the following:• Aliases are alternate forms that could potentially be matched to

the word. For example, Robert is a personal name alias for Bob.Alias data is output in the Match_Std fields.

• In the Address Cleanse transforms, an alias is an alternative formof a primary address line. Aliases apply only to primary addresses(usually streets), not secondary addresses or last lines.

• You can also create multiple aliases for table owners in adatastore and then use datastore configurations to change thealias values. By using aliases instead of real owner names, youlimit the amount of time it takes to port jobs to differentenvironments.

AMASAustralia Post’s Address Matching Approval System (AMAS). Toreceive postal discounts in Australia, you are required to file anAMAS report.

applicationAnother term for a software program.

association matching

Amethod of matching that combines the results of two or more Matchtransforms by using the Associate transform. Association matchingis used to find duplicates based multiple different match criteria (forexample based on Name+Address and then SSN+DOB) and bringthem together.

A common use for association matching is to identify customers whohavemultiple residences. Examples of such customers could includestudents and snowbirds.

attributeA property created for a type of object.

BAPIBusiness Application Programming Interface. A standardized SAPApplications programming interface that allows non-SAP applicationsto access specific business processes and data.

Basis

30 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 31: Data sevice architecture

The SAP infrastructure. Basis is the foundation for all SAP productsbased on ABAP.

batchExecutes one job or a series of jobs all at one time. After batchprocessing begins, it continues until it is done or until an error occurs.

batch jobThe unit of work that can be scheduled independently for executionby the Administrator. Jobs are special work flows that can bescheduled for execution, but cannot be called by other work flowsor jobs.

before-imageThe values in an UPDATE row before the row changes. You usebefore- and after-images of UPDATE rows for log-basedchanged-data capture (CDC) jobs which Data Services supports.

best recordContains the most complete, accurate, and up-to-date information.A best record is created by consolidating data elements frommatching records into a single record. For example, suppose youfound two records that match. One record has a phone number thatis different and more current than the other. You can move the morecurrent phone number into the other record to create your bestrecord.

A master record in a match group is also considered a “best” record,based on the best record priority assigned to the source that therecord was in.

best record priorityBest record priority is a way for you to designate data from aparticular source as having more importance than other data. Forexample, because your data warehouse meets your standards fordata, it might carry more weight in the matching process than woulda rented source.

The smaller the priority number, the higher the priority, and the morelikely that records from that source will rise to the top of their matchgroups to become master records. Assign a priority of 0 to your bestsource, and larger numbers to other sources.

The blank penalty can affect the value of the best record priority.

blank penalty

SAP BusinessObjects Data Services Getting Started Guide 31

AGlossary

Page 32: Data sevice architecture

In the Match transform, tells Data Services that records with blankfields should be considered less important (as driver or as Masterrecord) than records with completed fields (blank data = bad data).Blank penalties increase the value of the best record priority for thesource that the blank field exists in, thereby reducing the priority ofthe source. Lowering the priority of a source helps ensure that therecords in that source will not become the master record (or “best”record) of a match group.

BLOBA field whose data consists of Binary Large Objects—such as bitmapgraphics, images, OLE objects, metafiles, and so on.

blueprintA sample Data Quality job that can be used by Data Services withoutmodification.

Boolean expressionAn expression that defines a logical relationship between two ormore items. The expression is either TRUE or FALSE.

breadcrumbA visual path of your location in the application.

break groupPlaces records into groups that are more likely to match. Forexample, you might want to create a break group based on the firstthree digits of the postcode. This break group will ensure that recordswith a postcode of 546 are never even compared with records thathave a postcode of 611, saving valuable processing time for all butthe smallest jobs.

Break groups consist of driver and passenger records. Fieldscommonly used for creating break groups are postcodes, accountor Social Security numbers, or the first two positions of a street name.

break keyA user-defined field that is used to create break groups. Create abreak key if the data you want to break on is contained in multiplefields, such as the postcode and street name.

bulk loadingA software-basedmechanism that moves large amounts of data intoa database to achieve optimal performance. Bulk loading is faster

32 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 33: Data sevice architecture

than traditional INSERT statements. This mechanism supportscompression, blocking, and buffering to optimize transfer times.

business componentA set of tables Siebel applications use to create a logical objectcalled a business object.

business rules1. Settings within your Data Quality transforms that explain how you

want to process your data. These include things like telling theGlobal Address Cleanse transform how to case output data, orsetting up match criteria for a matching process.

2. Business rules can also be used to group validation rules fromValidation transforms for display in the Data Validation reportsin the Management Console.

Business viewsBusiness views in Crystal Reports enable you to control thepresentation of your database to report designers and users.

case-sensitivePertaining to the differentiation between upper-case and lower-caseletters. A case-sensitive program differentiates between upper-caseand lower-case letters when evaluating a text string.

CASSA United States Postal Service (USPS) certification that requiressoftware vendors to go through a series of tests to prove that theirsoftware correctly codes addresses according to USPS requirements,and produces the required USPS reports. Long form: CodingAccuracy Support System

CDC checkpointACDC checkpoint enables Data Services to restrict CDC subscriptionreads. After you enable a checkpoint, the next time the CDC jobruns, it reads only the rows inserted into the CDC table since thelast checkpoint.

CDC datastoreA CDC datastore allows you to limit extracted data to changed dataonly. A CDC datastore connects a changed-data capture table on asource database to Data Services.

CDC subscription

SAP BusinessObjects Data Services Getting Started Guide 33

AGlossary

Page 34: Data sevice architecture

A CDC subscription is an option on a source CDC table. You candefine multiple subscriptions on the same CDC table to allow differentdata flows to extract data from the same table without corruptingdata extracted by other data flows. A subscription defines the startand end of your data set, and it is often used with the check-pointoption.

changed-data capture (CDC)The process of retrieving changes made to a production data source.This process consolidates units of work, ensures data is synchronizedwith the original source, and reduces load times by loading onlychanged data in a warehouse environment.

Citrix MetaFrame XPCitrix MetaFrame XP software provides an access infrastructure forenterprise applications. You can use this software to run DataServices on a server which publishes instances of the Designer andother Data Services interfaces to users on client computers.

classificationsIndicators to Data Cleanse of the types of situations that apply tothis word. For example, Hewlett is assigned the Firm_Name andName_Weak_Family_Name classifications, because it can be usedin both firm and personal names.

client/serverA distributed technology approach where the processing is dividedby function. The server performs shared functions (such asmanagingcommunications and providing database services), while the clientperforms individual user functions.

commandA directive given to a program to initiate an action.

Communication StructureIn SAP NetWeaver BW, a data structure that defines a set ofInfoObjects available from an InfoSource to put into InfoCubes.

compare bufferA part of memory reserved for processing break groups (one breakgroup at a time) in the Match or Associate transform. A larger buffertypically helps improve performance.

conditional

34 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 35: Data sevice architecture

A single-use object, available in work flows, that allows you to branchthe execution logic based on the results of an expression. Theconditional takes the form of an if/then/else statement.

constantA data string that does not change from one record to the next.

content typeSpecifies the type of data in a field in your data source. This helpsyou map your fields when you set up downstream transforms.

contribution valueA value you assign to a match criteria that represents the importance(or weight) you place on that criteria’s data. For example, yourorganization may place a high degree of importance on the customernumber. For these types of criteria you would assign a highercontribution value to reflect a higher importance.

The contribution value is part of weighted scoring.

Crystal ReportsA reporting tool that allows users to create feature-rich reports andintegrate them into web and Windows applications.

Ctrl-clickAn action to select multiple values within an application. Thisaccomplished by pressing the Control key and using the mouse.

cube1. A multi-dimensional or OLAP database in which data is

summarized, consolidated, and stored in "dimensions" (eachrepresenting information such as customer or product line) and"measures" (for example sales, cost, or profit), enabling improvedprocessing time and storage space requirements over traditionaldata storage methods such as relational databases.

2. The combination of indexes (dimensions and measures) storedin SAP NetWeaver BW Accelerator.

custom ABAP programA custom ABAP program runs an ABAP program and generates adata set. With a custom ABAP program, you can run an existingABAP program as part of a job. Use a custom ABAP program as asource in a data flow or an ABAP data flow.

custom adapter

SAP BusinessObjects Data Services Getting Started Guide 35

AGlossary

Page 36: Data sevice architecture

An adapter developed using the Data Services Adapter DevelopmentKit.

custom functionA script you create to evaluate or make calculations on input valuesand produce a return value.

Data CleanseA transform that identifies and isolates specific parts of mixed data,and then standardizes the data based on information stored in theparsing dictionary, business rules defined in the rule file, andexpressions defined in the pattern file.

data extractionThe process of moving data from a database or application sourceto a database target (either from a legacy database to a data mart,or from one data mart to another).

data flowA reusable object containing steps to define the transformation ofdata from source to target. Data flows are called from inside a workflow or job. You can pass information into or out of data flows usingparameters.

data loadingThe process of populating a data warehouse. Data loading isprovided by DBMS-specific load processes, DBMS insert processes,and independent fast-load processes.

data mappingThe process of assigning a source data element to a target dataelement.

data martA highly-focused version of a data warehouse. Typically, created bya department or division of a company, data marts contain data fora specific subject area, such as finance or sales. Data Services canpopulate a data mart.

data movementThe aspect of the data integration process that includes extraction,data transformation, and loading (ETL). That which the applicationaccomplishes as a whole. Do not confuse with data transformation,which is what happens within one phase of a data flow.

data record

36 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 37: Data sevice architecture

A row of data that is constructed at runtime. The data remains in theform of the data record throughout the Data Services job.

data salvageThe process of temporarily copying data from a passenger recordto the driver record after the two records are compared. The datathat’s copied is data that is found in the passenger record, but ismissing or incomplete (initials, for example) in the driver record. Datasalvaging prevents blank matching or initials matching frommatchingrecords that you may not want to match.

Data ServicesA software system that allows users to build and execute applicationswith which they can create and maintain data warehouses.

Data Services consists of several components:

Data Services engineThe core process that reads job information from the Data Servicesrepository and sets up run-time processes that execute the job. Therun-time processes extract, transform, and load relational andhierarchical data. The Job Server starts the Data Services engineto execute batch or real-time jobs.

Data Services interfaceA program that Data Services uses to access data sources. Specificinterfaces vary by installation. There are internal interfaces (thosenative to the installation) and external interfaces (those that youinstall separately). Internal interfaces allow Data Services to accessapplications like SAP Applications and SAP NetWeaver BW,messages, relational database systems, and legacy systems. Anexternal interface is also known as an adapter. It allows DataServices to access applications using information exchangetechnologies such as JMS (Java Messaging Services) orSalesforce.com.

Data Services repositoryThe database that contains information about a Data Servicesapplication. The repository contains information about definedreusable objects, the metadata for sources and targets, transformsand functions. The repository also contains the job history andruntime statistics information. When you invoke Data Services, youlog in to the repository containing the objects you want to use. Youcan use a local repository or a central (shared) repository.

SAP BusinessObjects Data Services Getting Started Guide 37

AGlossary

Page 38: Data sevice architecture

The Data Services profiler uses a profiler repository to store profilingdata. The Cleansing Packages repository stores reference data forthe data cleansing transform.

All repositories are created and maintained with the RepositoryManager.

Data Services serviceThe process that ensures that the Access Server and the Job Serverare running. You can configure the Data Services service to restartthe Access Server and Job Server whenever the computer wherethey are located restarts.

data setRows of data with a defined schema. A step in a data flow—suchas reading data from a source, joining data in a Query transform, ortransforming data though another transform—yields a data set. Youcan view individual data sets by placing a target table or file at thatpoint in the data flow.

data source name (DSN)Provides connectivity for a Windows user to a database through anOpen Database Connectivity (ODBC) driver. The DSNmay contain:database name, directory, database driver, user ID, password, andother information.

data transformationThe phase of the data movement process that occurs betweenextraction and loading. Do not confuse with data movement, whichis what the data flow accomplishes as a whole. Data transformationdescribes a process, while a transform is a tool (a step, icon, orobject) in Data Services that enacts the transformation (such asquery, merge, or data cleanse).

data transportA step in an ABAP data flow that defines a target to store the dataset extracted during the flow. You can locate the target file on theSAP Application server or in a location accessible to both the SAPApplication server and to Data Services across a network.

data typeThe format used to store a value. Data types can imply a defaultformat for displaying and entering the value. Data read from a sourceis converted to the appropriate Data Services data types; data loaded

38 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 39: Data sevice architecture

to a target is converted from its Data Services data type to the typeappropriate for the target.

data validationDefining rules to which correct data should conform. In Data Services,you define these rules in the Validation transform. You can separatedata that passes the validation rules from failed data.

Data Validation dashboardA category of graphical reports in the Management Console toevaluate the reliability of your target data based on the validationrules you created in your Data Services batch jobs. This feedbackallows business users to quickly review, assess, and identify potentialinconsistencies or errors in source data.

data warehouseA Data Warehouse houses a standardized, consistent, clean andintegrated form of data sourced from various operational systemsin use in the organization, structured in a way to specifically addressthe reporting and analytic requirements. Data Services can populatea data warehouse.

databaseA collection of tables managed by a DBMS such as Microsoft SQLServer or Oracle.

database linkCommunication path from one database server to another. Thedatastores in a database link relationship are called linked datastores.Data Services uses linked datastores to enhance its performanceby pushing down operations to a target database using a targetdatastore.

DataConnectorDataConnector operator instances are used to read data filesgenerated by Data Services when performing bulk loading using theTeradata Warehouse Builder.

datastoreA logical channel connecting Data Services to a source or targetapplication. Different datastore types include database, application,web service, and adapters. The datastore definition typically includesthe name and location of the database as well as user authenticationinformation. Data Services uses a datastore definition to qualify a

SAP BusinessObjects Data Services Getting Started Guide 39

AGlossary

Page 40: Data sevice architecture

table name wherever a table is indicated in a diagram or expression.You can access the datastore definition through the object library.

datastore configurationDefines a connection to a particular database from a single datastore.

DBMS (database management system)A software system that builds and maintains database tables.

debug modeAllows you to diagnose errors while executing a job using theinteractive debugger features in the Designer.

degree of parallelism (DOP)

A property of a data flow that defines howmany times each transformdefined in the data flow replicates for use on a parallel subset ofdata.

For example, if you set the Degree of parallelism to 4, then whenthe job executes, Data Services replicates each transform in thedata flow four times. Each of these replicated transforms executesin parallel using a separate thread. The operating system willdistribute the threads among the available CPUs.

delimited flat fileA data file in which each column value is separated by a delimiter,such as a comma, semicolon, tab, space, and so on. Each row startsa new line.

delimiterData Services has three types of delimiters: column, row, and text(character string). To separate columns, a delimiter can be a tab,semicolon, comma, space, or any character sequence. To separaterows of data, a delimiter can be a {new line} or any other charactersequence. To denote the start and end of a character string, adelimiter can be single quotation marks ('), double quotation marks("), or {none}.

delivery point codeA two-digit number derived from the primary range (house number).This number is used in the generation of a DPBC barcode.

Delivery Point Validation (DPV)A technology that assists you in validating the accuracy of youraddress information with the USA Regulatory Address Cleanse

40 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 41: Data sevice architecture

transform. With DPV, you can identify addresses that areundeliverable as addressed and determine whether or not an addressis a Commercial Mail Receiving Agency (CMRA).

DesignerA graphical user interface that allows you to design and test DataServices jobs.

destination record

A location where you place your updated or “best” data when creatinga best record. A destination record can be either a master record, asubordinate record, or both in a match group.

diacritical characterA character that contains an accent, dieresis (umlaut), tilde, cedilla,or other distinguishing marks (for example, ä or Ç). You can chooseto have standardized data with these types of characters. Theapplication uses the Latin-1 code page for assigning these accents.

diagramThe icons and connections between the icons that make up thedefinition of a job, work flow, or data flow. Diagrams appear in theDesigner workspace.

dictionaryRelational database that contains a lexicon of words and phrasesthat the data cleansing packages and the Data Cleanse transformuse to identify, parse, and standardize data.

directionalA component of the address line that indicates direction. Forexample, North in “211 N. 115th St.”

discrete fieldInput or output data that has separate fields for each piece ofinformation, such as addresses and names.

discrete formatInput source format in which pieces of data are parsed down tonearly the most distinct level. For example, a “first name” field wouldbe discrete, whereas a “name” field that could contain first, middle,or last name information would not be discrete.

domain value

SAP BusinessObjects Data Services Getting Started Guide 41

AGlossary

Page 42: Data sevice architecture

In PeopleSoft, the category name (or link) between a value and itsdescription.

downstreamA data flow object, such as a transform, that is placed after anotherdata flow object in a job.

DPBC (Delivery Point Barcode)A form of Postnet barcode, consisting of 62 bars and based on thecombination of ZIP Code, ZIP+4, DPBC, and a check digit.

drill downA method of exploring detailed data that was used in creating asummary level of data. Drill-down levels depend on the granularityof the data in the data warehouse.

driver record

A record that drives the comparison process. Driver records are partof a break group and are compared with passenger records todetermine matches.

Driver records are chosen based on the driver order you assign toa source. (In general, a source with your best data should be usedfirst.) After a driver record has been compared with all of thepassenger records, the next passenger record in the break groupbecomes the driver record.

If you do not reorder your break groups using Group Prioritization,the driver record is the first record in the break group.

DTDDocument type definition. A text file that describes the elements(tags) in an XML document and the relationship among them. Whenan XML document is used to describe a transaction, the DTDdescribes the data schema used in the transaction.

dual address

A dual address occurs when a record contains two address lines.Two combinations are typical:• PO box and street address:

1000 Main Street, Suite 51PO Box 2342

42 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 43: Data sevice architecture

• Rural route or Highway Contract and street address:

RR 1 Box 34512784 Old Columbus Road

dual namesTwo names included on an address line, for example, John and JaneDoe.

Early Warning System (EWS)A solution for matching valid delivery points that have been createdbetween updates to the national ZIP+4 directory. EWS uses fourmonths of rolling data found in an intermediate directory that isupdated weekly with data from the USPS.

EDIElectronic Data Interchange. Electronic exchange of structured databetween businesses. This exchange is not dependent on hardware,software, or communication protocols.

elementA component found within XML Schemas and DTDs.

eLOT

Enhanced Line of Travel (eLOT) takes Line of Travel one step furtherin the presorting process. The original line of travel (LOT) narroweddown the mail carrier’s delivery route to the block face level (ZIP+4level) by discerning whether an address resided on the odd or evenside of a street or thoroughfare.

eLOT narrows the mail carrier’s delivery route walk sequence to thehouse (delivery point) level. This allows you to sort your mailings toa more precise level.

embedded data flowA data flow with an open begin or an open end point that can beused inside another dataflow. An embedded dataflow can be acombination of sources or targets and transforms, and is mainlyused to reduce the visual complexity of a diagram in a dataflow. Anembedded dataflow can be re-used in multiple other dataflows.

Enterprise applicationEnterprise applications enable enterprises to execute and optimizebusiness and IT strategies in domains like ERP (Enterprise Resource

SAP BusinessObjects Data Services Getting Started Guide 43

AGlossary

Page 44: Data sevice architecture

Planning), CRM (Customer Relationship Management) or SCM(Supply Chain Management). Enterprise applications usually storedata in a relational database optimized for operational use. SAPprovides these solutions through the SAP Business Suite. DataServices supports both SAP's own solution as well as third-partysolutions like Oracle e-Business Suite, Siebel, JD Edwards orPeopleSoft.

ERP system (Enterprise resource planning system). An enterprise application from which Data Services can extractdata. SAP offers this system as a solution part of the SAP BusinessSuite.

exceptionAn error that occurs while executing a job. You can catch individualor groups of exceptions using a try/catch block inside a work flow.Catching an exception allows you to automatically execute a solutionfor the error.

expressionA combination of variables, parameters, constants, and functionslinked by operation symbols and any required punctuation thatdescribe a rule for calculating a value. Expressions are used inconditionals, functions, scripts, transforms, and while conditions toroute information and change fields.

extract dateThe date that data was extracted.

extract frequencyThe interval at which data is extracted, such as daily, weekly,monthly, or quarterly. The frequency that data extracts are neededin the data warehouse is determined by the shortest frequencyrequested through an order, or by the frequency required to maintainconsistency of the other associated data types in the source datawarehouse.

fault code

A numeric value that is assigned to a record after the USARegulatoryAddress Cleanse transform validation process that signifies that theparticular record was not successfully validated. Each numeric valuerepresent a different type of fault.

file format

44 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 45: Data sevice architecture

A description of how data is or should be organized in a file DataServices reads from or loads to. A file format can be specific to asingle file or generic for many files.

filterAn expression that limits the data returned.

fixed-width flat fileA data file in which each column of data is the same width.

flat fileA flat file is a file containing records, generally one record per line.Fields may have a fixed width with padding, or be delimited by tabs,commas (CSV), or other characters. There are no structuralrelationships. The data is “flat” like a sheet of paper, rather than tomore complex models such as a relational database.

FSA (Forward Sortation Area)The first three characters of a Canadian alphanumeric postal code.For example, K1A in the postal code for Canada Post’s Ottawaheadquarters, K1A 0B1.

functionA program that operates on values that are passed to it. DataServices functions are available through a function wizard in a script,conditional, or Query transform. Data Services also gives you accessto functions provided by the DBMS you are using. In addition, youcan define your own functions using the Data Services scriptinglanguage.

gatheringRecombines terms that belong together, such as alphanumeric termsthat you would look up together in the dictionary. For example, ifData Cleanse breaks 1st into "1" and "st", then gathering recombinesthem to 1st.

genderA code that indicates the likelihood of a record being a certaingender. This code is derived from the name and has five possiblevalues: strong male, strong female, weak male, weak female,ambiguous, and unassigned. For example, a record marked as“strong male” indicates a high likelihood that the person is male.

generated field

SAP BusinessObjects Data Services Getting Started Guide 45

AGlossary

Page 46: Data sevice architecture

A field that is generated on output by a transform. For example, apostcode field generated by the Global Address Cleanse transform.

GeoCensusA directory that contains latitude, longitude, census tract, and blockinformation. That information sets the stage for mapping,demographic marketing, and other applications of your address data.

global suggestion lists

Global suggestion lists offer a way to complete and populateaddresses with minimal data, or it can offer suggestions for possiblematches. This address-entry system is ideal in call centerenvironments or any transactional environment where data cleansingis necessary at the point of entry. It's also a research tool to managebad addresses from a previous batch process.

Global suggestion lists are available with the Global Suggestion Liststransform.

highest level objectThe object that is not a dependent of any object in the objecthierarchy.

host nameThe computer’s network name (or IP address). Used most often inData Services to specify a computer where the Web application, theAccess Server, the Job Server, and real-time services reside.

hybrid formatA format for records in which some fields are discrete, whereasothers are in a multiline format.

IDocIntermediate Document. An SAP-specific format. Used for EDI(Electronic Data Interchange) and ALE (Application Link Enabling).

IDoc typeIndicates the SAP format that is used to interpret the data of abusiness transaction. Consists of the following components:• A control record: Identical for each IDoc type.• Several data records: A single data record consists of a fixed key

part and a variable data part. The data part is interpreted usingsegments, which differ depending on the IDoc type selected.

46 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 47: Data sevice architecture

• Several status records: Identical for each IDoc type. Describethe status states an IDoc has already passed through or thestatus an IDoc has attained.

impact and lineage analysisThe category of reports on the Management Console that showsthe relationship between source and target tables on Data Services,and with SAP BusinessObjects Enterprise objects such as universes,business views, and reports.

import

The process of acquiring information for the Data Services repository.Import the following kinds of information into Data Services:• The metadata for source and target databases• Descriptions and code for user-defined and DBMS functions and

transforms• ATL or XML files with definitions of Data Services objects that

were previously exported out of a another Data Servicesrepository.

InfoAreaIn SAP NetWeaver BW, an element for grouping meta-objects in theBW system. Each InfoProvider is assigned an InfoArea. The resultinghierarchy is displayed in the Data Warehousing Workbench.

In addition to their properties as an InfoProviders, InfoObjects canalso be assigned to different InfoAreas.

InfoCubeIn SAP NetWeaver BW, a type of InfoProvider.

An InfoCube describes a self-contained dataset (from the reportingview), for example, for a business-oriented area. This dataset canbe evaluated with the BEx query.

An InfoCube is a set of relational tables that are created inaccordance with the star schema: a large fact table in the center,with several dimension tables surrounding it.

InfoObjectIn SAP NetWeaver BW, Business evaluation objects (for example,customers or sales) are called InfoObjects.

SAP BusinessObjects Data Services Getting Started Guide 47

AGlossary

Page 48: Data sevice architecture

InfoObjects are subdivided into characteristics, key figures, units,time characteristics, and technical characteristics (such as requestnumbers).

InfoPackageIn SAPNetWeaver BW, describes which data in a DataSource shouldbe requested from a source system. The data can be preciselyselected using selection parameters (for example, only controllingarea 001 in period 10.1997).

An InfoPackage can request the following types of data• Transaction data• Attributes for master data• Hierarchies for master data• Master data texts

InfoPackages are also used to start Data Services jobs to load datainto SAP NetWeaver BW.

InfoSource

In SAP NetWeaver BW, a structure that consists of InfoObjects andis used as a non-persistent store to connect two transformations.

input fieldsOriginal fields in your input sources.

interactive debuggerA Designer feature that allows you to step through the data of a jobone row at a time using filters and breakpoints on a line. Likeexecuting a job, you can start the interactive debugger from theDebug menu when a job is active in the workspace. While in debugmode, all other Designer features are set to read-only.

interfaceData Services offers two types of interfaces:

An internal Data Services interface allows you to create datastoreconnections to natively supported applications.

An external Data Services interface (or adapter) allows Data Servicesto communicate with information exchange technologies such asthe Salesforce.com adapter.

intersource match

48 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 49: Data sevice architecture

Match between records of different sources.

intrasource matchMatch between records within a source.

JDBCA Java API developed by SunMicrosystems that acts as an interfacebetween a developer’s Java code and a database. It provides amechanism for the developer to connect to a specified database,request information about the database, and then select informationfrom it. Long form: Java Database Connectivity

jobThe unit of work that can be scheduled independently for executionby the Administrator. Jobs are special work flows that can bescheduled for execution, but cannot be called by other work flowsor jobs.

Job ServerA process that receives requests from the Designer and theAdministrator to start and stop jobs. To start batch or real-time jobs,the Job Server triggers the Data Services engine. Engine processesrun on the same computer as the Job Server process that triggersthem.

join rankA value given to or calculated for all data sets in a data flow. DataServices uses the join rank to determine which source to read firstwhen assembling the data set in a join. Data Services uses thesource with the lower join rank as the inner source of the join anduses the source with the higher join rank as the outer source of thejoin.

key

A value used to identify a record in a database.

key figureIn SAP NetWeaver BW, an InfoObject that represents a numericfact.

lastlineThe lastline of an address contains components such as the locality,region, and postcode (and it may contain the country name).

license-controlled feature

SAP BusinessObjects Data Services Getting Started Guide 49

AGlossary

Page 50: Data sevice architecture

A Data Services feature that is enabled or disabled based on theproduct license. The product license controls which icons and settingsare available in Data Services as an internal Data Services interface.

line of travel (LOT)A sorting sequence in which ZIP+4 codes are arranged in the orderthat they are served by the mail carrier. LOT sequencing is requiredfor some bulk mailing discounts.

linked datastoresThe datastores in a database link relationship. A database link storesinformation about how to connect to a remote data source, such asits host name, database name, user name, password, and databasetype. Data Services uses linked datastores to enhance itsperformance by pushing down operations to a target database usinga target datastore.

Local Delivery Unit (LDU)The last three characters of a Canadian alphanumeric postal code.For example, 0B1 in the postal code for Canada Post’s Ottawaheadquarters, K1A 0B1.

localeA set of parameters that define the user's language, country andany special variant preferences that the user wants to see in theiruser interface. A locale identifier consists of a codepage, a languageidentifier and a region identifier.

localityA part of the address line of a record. Locality most often refers tothe city or town. In some countries, such as the United Kingdom,locality can extend to include district.

Locatable Address Conversion System (LACS)A database of addresses that have been permanently converted,usually due to 911 emergency system implementation. The changesoften consist of conversion from rural-style addressing tostandardized, city-style addressing, or renumbering of existingcity-style addresses.

lookup tableContains data that other tables can reference with lookup functionsthat return one or more output columns.

mail piece unit

50 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 51: Data sevice architecture

Typically referred to as a version identifier for printers, it representsthe unique characteristics of a portion of a mailing. Every segmentwithin a Mail.dat must have at least one mail piece unit.

mapped fieldA field in a specific transform, for which it has been defined whichfield it should read from upstream transforms.

master recordThe first record in a match group. You can control which record isthe master record by using the Group Prioritization operation in theMatch transform.

match criteriaA group of options that determine the rules for matching on particulardata.

match groupA group of records found to be matching with each other. A matchgroup consists of a master record and subordinate records.

match level

A Match level designates the level in "hierarchically" type matching.One Match set can have one or more match levels. Duplicates thatare found at one level are passed to the next level, where they arecompared based on that level’s keys, and so on. For example, youcould use multiple match levels if you wanted to detect duplicatesat the household (residence), family, and individual level.

The order of the match levels is important because duplicates arefound at each level, and only the results are made available for thenext level. Usually, you will define your “broadest” match levels first,followed by more specific match levels.

match set

A group of criteria used to perform matching on your data.

A typical setup might have only select data reaching each match setfor comparison. For example, you might want to exclude blank SSNs(Social Security Numbers), certain foreign addresses, and so onfrom reaching a particular match set. A match set also allows formultiple match sets to be considered for association in a combinedmatch set.

matching record

SAP BusinessObjects Data Services Getting Started Guide 51

AGlossary

Page 52: Data sevice architecture

A group of records found to be matches based on the criteria andbusiness rules you choose. The records do not necessarily have thesame data.

memory datastoreA datastore connection/container for memory tables.

memory tableInternal Data Services table used to store a data set in memory whilea job runs. Use instead of staging tables to improve performance ofa real-time job built with multiple data flows. Use a memory table tomove a data set between data flows.

messageRepresents hierarchical data (such as a header with line items) fordocument-oriented transactions (such as a purchase order).

metadataIn Data Services, information acquired and maintained to describetables in source and target databases. This information includes thenames of tables and their columns, and the data types of thecolumns.

In general, metadata typically includes a description of data models,a description of the layouts used in database design, the definitionof the system of record, the mapping of data from the system ofrecord to other places in the environment, and specific databasedesign definitions.

multi-sourceRecords that appear on two or more sources. For example, let’s sayyou’re bringing together customer sources from several directmarketers or publishers. Your best prospects may be the peoplewhose names appear on two or more sources, indicating they maybe most receptive to your offer.

multilineThe multiline format is a database record format in which addressdata is not consistently located in the same arrangement in allrecords. That is, data items “float” among fields. For example, aninput source may have fields named Line1, Line2, Line3, and Line4that contain various categories of name and address data, as wellas non-address data.

nested data

52 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 53: Data sevice architecture

Data in one table that is related to a single row of another table. Anested table appears in Data Services as a column in a parent table.Columns in the nested table can themselves contain tables.

normal sourceA source of records that the application should consider to be good,eligible records in a matching or association process.

North American Numbering Plan (NANP)

Telephone numbering plan shared by 19 North American countries.These countries include the United States and territories, Canada,Bermuda, Anguilla, Antigua & Barbuda, the Bahamas, Barbados,the British Virgin Islands, the Cayman Islands, Dominica, theDominican Republic, Grenada, Jamaica, Montserrat, St. Kitts andNevis, St. Lucia, St. Vincent and the Grenadines, Trinidad andTobago, and Turks & Caicos.

nullThe absence of a value within a database field for a given record. Itdoes not mean zero because zero is a value.

objectAny item that you create in the Designer. Data Services distinguishestwo classes of objects: reusable objects that are complete and canbe reused in your projects (such as data flows) and single-use objectsthat only appear as components of other objects (such as a try/catchblock). This distinction affects how you create and retrieve each typeof object.

object definitionThe options that describe the operation of an object. To view andmodify an object definition, open the object so that its definitionappears in the workspace.

object dependentAn object associated beneath the highest level object in thehierarchy.

object libraryA tool in the Designer that gives you access to reusable objects.

object versionAn instance of an object. Each time a you add or check in an objectto the central repository, Data Services creates a new version of the

SAP BusinessObjects Data Services Getting Started Guide 53

AGlossary

Page 54: Data sevice architecture

object. The latest version of an object is the last or most recentversion created.

ODBC (Open Database Connectivity)A standard developed by the Microsoft Cooperation. It is an interfacethat gives applications the ability to retrieve data in data managementsystems using SQL for accessing the data. Such an interface allowsa developer to develop, compile, and ship applications withouttargeting specific database management systems.

ODS (Operational data store)An OLAP-designed relational database that an enterprise hasdesignated as the operational database of record (for example, afinance department might use an ODS to close its books).

OLAP (Online Analytical Processing)An approach to quickly answer multi-dimensional analytical queries.Databases configured for OLAP use amultidimensional data model,allowing for complex analytical and ad-hoc queries with a rapidexecution time. OLAP systems are used in a query environment,such as for a business intelligence application.

OLTPOnline transaction processing. A relational database design optimizedfor operational use. OLTP systems are used in an operationalenvironment, such as for an enterprise application.

open hub destinationAn object within the open hub service that contains all informationabout a target system for data in an InfoProvider. The open hubservice enables you to share data from an SAP NetWeaver BWsystem to non-SAP data marts, analytical applications, and otherapplications such as Data Services. It ensures controlled distributionand the consistency of data across several systems.

operation codeA flag associated with a row in a data set that indicates the statusof the data in the row. The operation codes are INSERT, UPDATE,DELETE, and NORMAL.

operational dashboardA category of reports on the Management Console to see at a glancethe status and performance of job and data flow executions over agiven time period.

54 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 55: Data sevice architecture

optionBusiness rules that can be set for a Data Quality transform thatspecify how you want to process your data. Each Data Qualitytransform has a different set of available options. Options and theirvalues are displayed in the Option Editor.

Option EditorA tab in a Data Quality transform editor through which you canchange the value for each option within the transform.

Option ExplorerA pane in the Associate, Match, and User-Defined transform editors.The Option Explorer shows a list of the option groups within atransform.

option groupContain a set of options that allow you to set different business rulesfor a transform. These are displayed in the Option Explorer.

other sourceIn a Match transform, a source of records that should be treated astransparent, such as seed sources. They are not counted indetermining how to characterize a match group—for example,multi-source or single-source. For example, some mailers use aseed source of potential buyers who report back to the mailer whenthey receive a mail piece so that the mailer can measure delivery.

parameterA value passed to a work flow or data flow when that flow is called.

partitionTo divide table data into sets based on a criteria such as a range orlist of values in each row. You can configure Data Services to readand write partitioned table data in parallel threads. Designing jobswith partitioned table data can improve job performance if a JobServer's computer memory and number of CPUs supports the job'sparallel-processing configuration settings.

passenger recordThe records that are compared against driver records in a breakgroup. After a driver record has been compared with every passengerrecord in a break group, a passenger record can become the newdriver record in the break group, or it can be found to be a match

SAP BusinessObjects Data Services Getting Started Guide 55

AGlossary

Page 56: Data sevice architecture

with a driver record. At this point it is taken out of the comparisonprocess.

pattern fileUser-defined patterns are stored in a pattern file. The pattern file isa plain text file and can be edited in any text editing program. Thepattern file is used by the Data Cleanse transform.

pick listA type of list returned by the Global Suggestion Lists transform thatis used to narrow down an address by starting with minimalinformation. A pick list returns possibilities in a similar manner to asuggestion list. You can pick an entry from this list to continueprocessing.

PMB (Private mail box)Private mail boxes are like post-office boxes but they are hosted byprivate companies. The USA Regulatory Address Cleanse and theGlobal Address Cleanse transforms can recognize certain forms ofPMB data when it appears in an address line.

postal addressA delivery address that is a rural route or box number.

postal codeA system of letters and/or digits used for sorting mail. Examplesinclude the ZIP Code used in the United States and the alphanumericFSA LDU system used in Canada.

postcode moveA valid postcode that has been split or moved, so only a portion ofthe area that had been covered by the one postcode now has twoor more postcodes, including the original one, for the same area.

Postcode2The secondary part of a postal code. For example, in the UnitedStates, a postcode is composed of two parts (54601-4051). The firstfive digits are followed by a hyphen and a four-digit code. Thefour-digit code is the Postcode2 for a US postcode.

prepackaged adapterAn adapter prebuilt and purchased from SAP, such as the DataServices Salesforce.com adapter.

primary entry

56 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 57: Data sevice architecture

A word or phrase in the dictionary that the data cleansing packagesand Data Cleanse transform use to identify, parse, and standardizedata.

primary keyA column that is guaranteed to contain unique values, and whosevalues identify all of the rows in a table.

projectThe collection of jobs available in the Designer at a given time. Aproject provides a way to organize the objects you create.

propertyDetailed descriptive information about objects that you display onthe Designer. It includes information such as when it was created.

Query transformA data transformation object that you can use to map columns froma source to a target schema, add new columns to the target schema,determine the data to extract, and perform operations on the data.Similar to an SQL SELECT statement, a query creates a data setthat satisfies the conditions you specify.

Rapid MartRapid Mart packages provide prebuilt data mart solutions forenterprise applications, such as SAP, PeopleSoft, Oracle, and Siebel.These powerful solutions combine domain knowledge and dataintegration best practices in prebuilt data models, transformationlogic, and data extraction. Rapid Marts packages are add-ons toData Services.

real-time jobA group of objects (data flows, work flows, conditionals, scripts, andso forth) that execute on-demand as a "request-response" system.You design real-time jobs in the Designer, then configure them asreal-time services and associate them with an Access Server in theAdministrator, where they are started, managed and monitored.When a real-time service receives a request from a caller, itprocesses the request and returns a reply.

reference fileA file of address data used by Data Services to match, assign,standardize, and verify addresses. Reference files are also referredto as postal directories. These files have a .dir extension.

SAP BusinessObjects Data Services Getting Started Guide 57

AGlossary

Page 58: Data sevice architecture

relational dataA data set in which data in each column contains a scalar value.Data Services can process relational data; it can also process nesteddata.

repositorySee Data Services repository.

request/acknowledge operationThis operation is used to execute a remote HTTP service in theRequest Acknowledge mode. In other words, it makes the requestto the remote machine where the HTTP Adapter server is runningand does not wait for the reply; instead, it sends an acknowledgementif the operation is successful.

request/reply

This operation is used to execute a remote HTTP service in theRequest Reply mode. In other words, it makes the request to theremote machine where the HTTP server is running and waits for thereply.

reusable objectAn object (such as a data flow, datastore, or job) that can be defined,stored, and reused independent of other objects. Any object that isvisible in the object library.

RFC (Remote Function Call) serverThe Data Services RFC server allows third-party programs, includingSAP Applications and SAP NetWeaver BW, to schedule and initiateData Services jobs and return the results to Data Services.

RFC server InterfaceThe node on the Administrator application of the Data ServicesManagement Console where you configure SAP connections to loaddata into or read data from an SAP NetWeaver BW system. DataServices uses the RFC server interface to to schedule SAP jobs,read from SAP open hub destinations, load data into SAPNetWeaverBW, and to view Data Services logs from SAP NetWeaver BW.

rule file

For the Data Cleanse transform, the rule file controls how theapplication parses groups of output type subcomponents for name,firm, phone, SSN, and other non-address data.

58 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 59: Data sevice architecture

For example, if you input “Mr. and Mrs. John Smith,” the applicationcould parse it into the individual components “Mr.,” “and,” “Mrs.,”“John,” “Smith.” This is very useful, but generally, you would alsowant to parse the whole group of related data “Mr. and Mrs. JohnSmith.” To parse data in this way, you must create rules.

rule matchingMatches the token classifications against defined rules.

sample sizeThe number of rows to display in the View Data feature.

sampling rateThe number of rows processed after which Data Services writesinformation to the monitor log file and updates job events.

sampling rowsThe frequency to select a sample row to profile, starting with the firstrow of the specified number of sampling rows. For example, if youset Profiling size to 1000000 and set Sampling rows to 100, theProfiler profiles rows number 1, 101, 201, and so forth until 1000000rows are profiled.

SAP ApplicationsAn ERP system. Formerly known as SAP R/3 or SAP ERP.

SAP BusinessObjects EnterpriseA business intelligence platform that powers the management andsecure deployment of specialized end-user tools for reporting, queryand analysis, and performancemanagement on a scalable and openservices-oriented architecture.

SAP BusinessObjects InfoViewA web-based interface that end users access to view, schedule, andkeep track of published reports. InfoView consolidates thepresentation of a company's business intelligence information andallows it to be accessed in a way that is secure, focused, andpersonalized to users inside and outside an organization.

SAP BusinessObjects Rapid MartSAP BusinessObjects Rapid Mart packages provide prebuilt datamart solutions for enterprise applications, such as SAP, PeopleSoft,Oracle, and Siebel. These powerful solutions combine domainknowledge and data integration best practices in prebuilt data

SAP BusinessObjects Data Services Getting Started Guide 59

AGlossary

Page 60: Data sevice architecture

models, transformation logic, and data extraction. Rapid Martspackages are add-ons to Data Services.

SAP BusinessObjects Web IntelligenceA web-based query and analysis tool that enables users to track,understand, and manage corporate data using a simple browser astheir interface, while maintaining tight security over data access.Long form:

SAP NetWeaver Business Warehouse (SAP NetWeaver BW)SAP NetWeaver Business Warehouse. Formerly known as SAPBusiness Information Warehouse.

scriptA step in a job or work flow that allows you to calculate values topass to other parts of the job or work flow. The script can callfunctions, execute if-then-else statements, and assign values tovariables. Write a script in the Data Services scripting language.

secondary informationAssists Data Cleanse in determining how to process the word whenit is used in different ways. Secondary information can include howData Cleanse will standardize the output data for the word oralternate forms that could potentially be matched to the word .

segmentFormat with which the data records of IDocs are interpreted.

SERPCanada Post Corporation’s Software Evaluation and RecognitionProgram. Data Quality is certified under this program, allowing youto receive postage discounts for mailings to and within Canada.

server groupA defined collection of Job Servers on different computers. A servergroup automatically measures resource availability on each JobServer in the group and distributes batch jobs or part of a job to theJob Server with the lightest load at run time. Use the Server Groupsnode in the Administrator’s navigation tree to group Job Servers thatare associated with the same repository into a server group.

service requestAny message sent from a Web client that requires processing by areal-time job.

similarity score

60 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 61: Data sevice architecture

A percentage that indicates how much two fields or values areconsidered alike. This percentage is calculated by the applicationafter the comparison process. For example, Ron and Rob areconsidered 67% alike because two of the three characters are alike.

Similarity scores are used in a number of situations— not just in theMatch transform. For example, they can be used to determine whichsuggestions to return for suggestion lists.

The similarity score is not always a direct result of a one-to-onecomparison; it can be altered by some options, such as those definedin the Match transform, for example.

single-use objectA step in a work flow or data flow that cannot be saved independentlyof the flow. Create single-use objects (such as a try/catch block,script, or conditional) from the tool palette.

smart editorA flexible editing tool in Data Services used for creating scripts,expressions, and custom functions without having to type the namesof existing elements like column, function, and variable names.

SNMP (System Network Management Protocol)A protocol that helps network administrators manage network routinghardware. The protocol can manage a variety of hardware andsoftware devices. Data Services supports monitoring through SNMP.

snowbirdA casual term to describe someone who has multiple residences.This term is derived from individuals who reside in a cooler-climateregion during the summer, and relocate to a home in awarmer-climate region during the winter.

SOAP (Simple Object Access Protocol)An XML-based message protocol used to encode the information ina web service request and response messages before sending themover a network or Internet.

source1. An object (table, file, or legacy system) from which Data Services

reads data.2. For the Match transform, the grouping of records on the basis of

some data characteristic that you can identify. A source mightbe all records from one input file, or all records that contain a

SAP BusinessObjects Data Services Getting Started Guide 61

AGlossary

Page 62: Data sevice architecture

particular value in a particular field. Sources are abstract andarbitrary—there is no physical boundary line between sources.Source membership can cut across data sources as well asdistinguish among records within a data source, based on howyou define the source.

source group

A group of sources that you can use to prepare a second set ofmatch statistics, combining the statistics for two or more regularsources. For example, suppose you define five sources—two housesources and three rented sources. You would get match statisticsfor each individual source. But suppose that you also wanted asummary for the house sources and a summary for the rentedsources. You could create two source groups—one for the housesources and one for the rented sources.

Source groups affect only the way that match statistics are reported.They do not affect matching or record priority.

source recordThe location where the data you want to use to update or createyour best record with resides. A source record can be the master orsubordinate record of a match group.

SQL (Structured Query Language)A query language for accessing relational, ODBC, DRDA (DistributedRelational Database Architecture), or non-relational databasesystems.

SQL query toolAn end-user tool that accepts SQL to be processed against one ormore relational databases.

standards

Define how Data Cleanse will standardize capitalization or otheroutput formatting on data.

star schemaA database design you can use to format data in a data mart. Thisdesign is based on a single fact table to which any number ofdimensional tables may be joined. This type of database designsupports multi-dimensional database analysis.

step

62 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 63: Data sevice architecture

An object that is part of the definition of a work flow or data flow.Each step is represented by an icon in the diagram of the flow andis connected to other steps to indicate the flow of data through thedata flow or the order of execution in the work flow.

street addressA delivery address that is the street name and house number.

subordinate recordRecords that are part of a match group, and are found to be matcheswith (and subordinate to) a master record. Subordinate records cancontain data that may be used to update a master record and, thus,create a best record.

substitution parameterA text string "alias" that you can use within your job and transforms.You define a substitution parameter and its value in a substitutionparameter configuration. Then, at runtime, that parameter is replacedwith its value anywhere it is used in your job.

substitution parameter configurationThe definition of the substitution parameters used throughout yourjob in a particular run-time environment . If you change the run-timeenvironment, you can change the substitution parameterconfiguration before you execute the job.

suggestion listsNormally, when an address cleansing transform looks up an addressin the postal directories, it finds one matching record. Sometimes,due to incomplete information, there may be two or more records(or suggestions) in the postal directories that could possibly be thecorrect record. Suggestion lists provide you with a list of “matching”addresses, so that you can choose which is the best address.

suppression source

A source that contains records of information that should be excludedfrom other output destinations. The records in the suppression sourceare used for matching in other sources. The records that match thesuppression source could then be removed from further processing.

For example, suppression sources may be your own bad-accountfile or no-mail sources provided by the government ordirect-marketing association (DMA) to prevent wasted mailings andoffending consumers.

SAP BusinessObjects Data Services Getting Started Guide 63

AGlossary

Page 64: Data sevice architecture

system configurationGroups together a set of datastore configurations and a substitutionparameter configuration. Data store configurations define datastoreconnections. A substitution parameter configuration can beassociated with one or more system configurations. For example,you might create one system configuration for your local system anda different system configuration for another system. When executinga job, you can specify which system configuration to use.

tableA database table that Data Services reads data from or loads datainto. The path and mechanisms for reading and loading data andapportioning the data among rows and columns are defined in thedatastore that the table is associated with. Writing a data set to adatabase table means sending a combination of rows withappropriate operation code to the database table.

targetAn object in which Data Services loads extracted and transformeddata in a data flow. Data Services loads rows flagged as INSERT,UPDATE, or DELETE.

TCP/IPTransmission Control Protocol/Internet Protocol. The basiccommunication protocol of the internet, and often intranets andextranets. A computer having direct access to the internet containsa copy of the TCP/IP program. TCP/IP makes it possible forcomputers to communicate with each other.

Tdpid(Teradata Director Program ID) The server name Data Servicesuses when loading with the bulk loader option. Data Services usestdpid as a Teradata Warehouse Builder operator attribute.

territoryThe locale value for a geographical location (usually the country)where a locale language is used. The paring of a language with aterritory determines factors such as date format, time format, decimalseparator, currency format, and so on.

threadThe instance of the program running on behalf of some process.Data Services typically creates one thread per data flow object. Ifyou are using parallel objects in data flows, the thread count will

64 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 65: Data sevice architecture

increase to approximately one thread for each source or target tablepartition. If you set the Degree of parallelism (DOP) option for yourdata flow to a value greater than one, the thread count per transformwill increase. The operating system will distribute the threads amongthe available CPUs.

tokenizationAssigns specific meanings to each of the pieces that result fromword breaking. Data Cleanse looks up each individual input word inthe dictionary. A list of tokens is created using the classificationsassociated with each word in the dictionary.

tooltipA small pop-up window with descriptive text.

transfer ruleIn SAP NetWeaver BW, transfer rules help you determine how thefields for the transfer structure are assigned to the InfoObjects ofthe communication structure.

transfer structureIn SAP NetWeaver BW, a structure in which data is transferred fromthe source system into BW. It displays a selection of fields for anextract structure of the source system. To an ETL tool like DataServices, a transfer structure looks like a table.

transformA step in a data flow that acts on a data set. Data Services transformsare available through the object library in three cateogories: DataIntegrator, Data quality, and Platform.

transparent network substrate (TNS)The Oracle networking technology that provides a single applicationinterface to all industry-standard networking protocols. It is storedin the tnsnames.ora network configuration file. Use a TNS to connectto your Oracle database or the Data Services Repository (stored inan Oracle database).

try/catch blockA combination of a try object and one or more catch objects thatdefine alternate execution paths in case an error occurs during theexecution of a job. You can tune try/catch blocks to trap specificerrors and to provide general alerts or messages if an error occurs.

Unicode

SAP BusinessObjects Data Services Getting Started Guide 65

AGlossary

Page 66: Data sevice architecture

A standard that was designed to create a universal character set. Itaccomplishes this by providing a unique number for every characterin every language.

The Unicode Standard describes more than 50,000 characters,including all the characters of the common character sets in usewhen Unicode was established around 1990, as well as many thathave been added since then. Unicode is an open character set,meaning it can continue to incorporate characters as needed.

Unicode can handle letters, punctuation, and technicalsymbols—regardless of platform, program, writing system, orlanguage.

unique identifier

In a Data Quality transform, an ID that is unique to a record or groupof matching records. It is sequential, static, and will not change whenrecords are updated or re-processed through the application.

unique recordRecords that do not have any matching or subordinate records and,therefore, do not belong to any match group after the matchingprocess is complete.

universeIn SAP BusinessObjects Enterprise, an abstraction of a data sourcethat presents data to users in non-technical terms.

upstreamA data flow object, such as a transform, that is placed before anotherdata flow object in a job.

variable

A symbolic placeholder for a value. Data Services lets you definelocal variables and global variables.• Local variables pertain to the work flow or custom function in

which they are defined. You can pass the value into another workflow or data flow using a parameter.

• Global variables pertain to the job in which they are defined. Withglobal variables, there is no need to define parameters betweenobjects in the job. Global variables can also be selected atexecution time. This eliminates the need to open the Designerto set global variable values.

66 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 67: Data sevice architecture

web serverA machine or application that serves web pages over the Internetor intranet. A web server hosts pages, scripts, programs, andmultimedia files and then serves them using HTTP, which sendsfiles to a client web browser.

web servicesA standard platform for integrating applications. Web Services allowdifferent programs, constructed in different languages, on differentplatforms to communicate with each other.

weighted scoringA method of comparison that provides you with a greater degree ofcontrol in the matching process. This method allows you to usecontribution values to place more or less importance on variousmatch criteria.

word breakingBreaks the input line down into smaller, more usable pieces. Bydefault, Data Cleanse breaks an input line on white space,punctuation, and alphanumeric transitions. Terms such as 20GB,4G, 1st, and U2 each break into two tokens at the alphanumerictransition. For example, "20GB" breaks into "20" and "GB" tokens.

work flow

A reusable object containing steps to define the order of jobexecution. Work flows call data flows, but cannot manipulate datathemselves.

You call work flows from inside other work flows or jobs. You passinformation into or out of work flows using parameters. You reusework flows by dragging existing work flows from the object library.

workspaceThe window inside the Designer in which you define, display, andmodify objects. The workspace for a data flow contains an area tobuild a diagram representing the data flow definition. The workspacefor a transform contains an editor for modifying the transform options.

WSDLWeb Services Definition Language.Web services are self-contained,modular business process applications based on open Internetstandards.

XML

SAP BusinessObjects Data Services Getting Started Guide 67

AGlossary

Page 68: Data sevice architecture

Extensible Markup Language. This markup language is like HTML(Hypertext Markup Language) in that it specifies a standard withwhich you can define your own markup languages with their ownsets of tags. XML allows you to define various tags with variousrules, such as tags that represent business rules, tags that representdata description, or tags that represent data relationships.

XML SchemaThe XML format used by Data Services to support messageprocessing that includesWeb Services. XML Schemas describe thedata structure of an XML file or message. Data flows can read andwrite data to messages or files based on a specified XML Schemaformat. You can use the same XML Schema to describe multipleXML sources or targets. XML Schema properties include: Name,Description, Imported from, Root element name, and Namespace.

Z4ChangeThe Z4Change directory lists all the ZIP and ZIP+4 Codes in thecountry. A record in this file is tagged if it has changed within thelast 12 months. The change might be a postal-code change (ZIP,ZIP+4, or CART), or even a change in the standardized form of theaddress-line or city name.

ZCFThe ZIP-City File directory that is used by the USA RegulatoryAddress Cleanse transform when processing data from the U.S.

ZIP CodeZIP is an acronym that stands for "Zone Improvement Plan." This isa 3-, 5-, or 9-digit number that represents a geographic region of theUnited States. The ZIP Code is important in determining entryeligibility and presort containerization. Note that this code is differentfrom a facility code.

ZIP+4A nine-digit number, consisting of the ordinary ZIP Code and afour-digit, add-on code.

zoneThe ZIP-City File directory that is used by the USA RegulatoryAddress Cleanse transform when processing data from the U.S.

68 SAP BusinessObjects Data Services Getting Started Guide

GlossaryA

Page 69: Data sevice architecture

IndexAAccess Server

description 16Adapter SDK 21Address Server 16Administrator

description 16Auto Documentation reports 18

Ccentral repository 14components

description 12

DDesigner

description 14distributed architecture 23distributing components across network 23

Eengine 15

Hhost names using IP address 25

IImpact and Lineage Analysis reports 17

IP addresseshost name, using for 25specifying connection 25

JJob Server

description 15

LLicense Manager 22local repository 14

Mmanagement tools 22Metadata Integrator

description 19metadata, reporting tool 17

Nnetwork, models of distribution 23

Ooperating systems supported 23Operational Dashboard reports 18

Pports

requirement for 25

SAP BusinessObjects Data Services Getting Started Guide 69

Page 70: Data sevice architecture

Rrepository

central 14description 14local 14

Repository Manager 22

Sscalability 25Server Manager 22

standard components 12

TTCP/IP

connections required 25connections, defining 25

Uutilities 22

70 SAP BusinessObjects Data Services Getting Started Guide

Index