Click here to load reader

Oracle®EnterpriseDataQualityCustomerDataServicesPack · PDF file TableofContents TableofContents 3 Chapter1:Introduction 4 Chapter2:CleaningServices 5 2.1AddressClean 5 2.2IndividualClean

  • View

  • Download

Embed Size (px)

Text of Oracle®EnterpriseDataQualityCustomerDataServicesPack · PDF file TableofContents...

  • Oracle ® Enterprise Data Quality Customer Data Services Pack

    Business Services Guide

    Version 9.0

    April 2013

  • Copyright © 2006, 2013, Oracle and/or its affiliates. All rights reserved.

    Oracle ® Enterprise Data Quality Customer Data Services Pack, version 9.0

    Copyright © 2006, 2013, Oracle and/or its affiliates. All rights reserved.

    This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited.

    The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing.

    If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable:

    U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation shall be subject to the restrictions and license terms set forth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License (December 2007). Oracle America, Inc., 500 Oracle Parkway, Redwood City, CA 94065.

    This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications.

    Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners.

    Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group.

    This software or hardware and documentation may provide access to or information on content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services.

    - 2 -

  • Table of Contents

    Table of Contents 3

    Chapter 1: Introduction 4

    Chapter 2: Cleaning Services 5

    2.1 Address Clean 5

    2.2 Individual Clean 9

    2.3 Entity Clean 10

    Chapter 3: Clustering Services 13

    3.1 Use 13

    3.2 Interface 13

    3.3 Parameters 14

    Chapter 4: Matching Services 15

    4.1 Use 15

    4.2 Matching using Multiple Identifier Values 16

    4.3 Interfaces 17

    4.4 Parameters 17

    Chapter 5: Data Interfaces 19

    5.1 Candidate Interfaces 19

    5.1.1 Individual Candidates 19

    5.1.2 Entity Candidates 22

    5.1.3 Address Candidates 24

    5.2 Matches Interface 24

    Chapter 6: Real-Time Integration 26

    - 3 -

  • Chapter 1: Introduction Oracle EDQ-CDS can be used as a standalone addition to EDQ, or it can be called by third- party applications.

    The provided business services are ready for integration with Siebel (CRM or UCM) but may also be called by other applications if they are configured to do so.

    Out-of-the-box, CDS provides pre-configured data quality services which can be modified or enhanced by editing the underlying EDQ processes that implement their functionality. There are three types of service, and each of the three types of service can be used with Individual, Entity, and Address records:

    l Cleaning

    l Clustering (for use in Matching integrations)

    l Matching

    NOTE: It is possible to further extend the functionality of the EDQ-CDS pack by installing the Data Quality Health Check project, which allows users to perform batch data quality checking of customer data. See the EDQ-CDS Data Quality Health Check Guide for further information.

    - 4 -

  • Chapter 2: Cleaning Services There are three cleaning services provided in EDQ-CDS: Address Clean, Individual Clean and Entity Clean. The Individual and Entity Clean services are provided as placeholders, which are pre-integrated with Siebel, and easily integrated with other applications, but which need to be modified towards specific requirements.

    2.1 Address Clean The EDQ-CDS Address Clean web service provides the following functionality:

    1. Verification of an input address (returning a verification code and description)

    2. Geocoding of an address (returning latitude and longitude co-ordinates, with addi- tional metadata)

    3. Correction, standardization and completion of input addresses (provided the address was verified to a sufficient, configurable, level)

    The Address Clean service appears in the EDQ-CDS project as the AddressClean web service, and its functionality provided by the Real-time Address Clean job, which uses the Clean - Address and Clean - Address Post Process processes.

    The service is N:N; that is, single and multiple record input and output is possible, but only one record is returned for each record submitted. Each input address is verified and may be corrected, enhanced and geocoded, depending on the options that the job is run with, and the input parameters.

    Note that Siebel's Data Quality interface always calls the service with a single record per request, whether running in real-time or batch.


    The Address Clean web service is normally used for real-time verification and cleansing of addresses as they are entered and updated in an application, such as Siebel.

    In the case of Siebel, the web service is also used in batch. When a batch address cleansing job is run, the web service will be used on all of the in-scope records in the batch job.

    For other applications, it is recommended to add configuration to EDQ to map data from and to the Address Clean data interface in order to run the service in batch mode.


    The following table provides a guide to the interface attributes of the AddressClean web service.

    Attribute Name Data Type

    Use Notes

    addressid String In/Out Unique identifier for the address. address1 String In/Out Address line 1 address2 String In/Out Address line 2

    - 5 -

  • Attribute Name Data Type

    Use Notes

    address3 String In/Out Address line 3 address4 String In/Out Address line 4 dependentlocality String In/Out A smaller population center data element,

    dependent on the contents of the city field. For example, a Neighborhood in Turkey. For many countries, this attribute is not used.

    doubledependentlocality String In/Out The smallest population center data element, dependent on both the contents of the city and dependentlocality fields. For example, a village in the UK. For many countries, this attribute is not used.

    city String In/Out The locality, town or city of the address. subadminarea String In/Out The smallest geographic data element within

    a country. For example, a county in the USA. adminarea String In/Out The most common geographic data element

    within a country. For example, a State in the USA or a Province in Canada.

    postalcode String In/Out Postal or zip code for the address, if relevant for the country.

    country String In/Out On input, a country name or ISO 2 char code. On output, the full country name (even if the input is the country ISO code).

    defaultcountrycode String Input only

    Default ISO 2 char country code to use if country not populated. This overrides the default value used when running the job.

    case (parameter) String Input only

    Transforms output according to the setting: U (Upper)- Transforms all text to Upper case. L (Lower)- Transforms all text to Lower case. M (Mixed) - Transform all text to Mixed case, except postalcode and adminarea. These field values are left as returned from the AV processor if the address was verified. If the address was not verified, the postalcode is left as entered, and the adminarea is converted to Mixed case. O (Original) - Text is not transformed.

    mode (parameter) String Input only

    Mode in which the request is to be run. Currently, only V (Verify) is supported.


Search related