Sunteți pe pagina 1din 48

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Data Quality Management for Informatica (12.0.0) August 2009

Notices
Copyright information

2009 SAP, AG. All rights reserved. SAP Business Objects owns the following U.S. patents, which may cover products that are offered and licensed by Business Objects: 5,295,243; 5,339,390; 5,555,403; 5,590,250; 5,619,632; 5,632,009; 5,857,205; 5,880,742; 5,883,635; 6,085,202; 6,108,698; 6,247,008; 6,289,352; 6,300,957; 6,377,259; 6,490,593; 6,578,027; 6,581,068; 6,628,312; 6,654,761; 6,768,986; 6,772,409; 6,831,668; 6,882,998; 6,892,189; 6,901,555; 7,089,238; 7,107,266; 7,139,766; 7,178,099; 7,181,435; 7,181,440; 7,194,465; 7,222,130; 7,299,419; 7,320,122 and 7,356,779. Business Objects and its logos, BusinessObjects, Business Objects Crystal Vision, Business Process On Demand, BusinessQuery, Cartesis, Crystal Analysis, Crystal Applications, Crystal Decisions, Crystal Enterprise, Crystal Insider, Crystal Reports, Crystal Vision, Desktop Intelligence, Inxight and its logos, LinguistX, Star Tree, Table Lens, ThingFinder, Timewall, Let There Be Light, Metify, NSite, Rapid Marts, RapidMarts, the Spectrum Design, Web Intelligence, Workmail and Xcelsius are trademarks or registered trademarks in the United States and/or other countries of Business Objects and/or affiliated companies. SAP is the trademark or registered trademark of SAP AG in Germany and in several other countries. All other names mentioned herein may be trademarks of their respective owners.

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Contents

Preface .............................................................................................................5 About this guide.........................................................................................5 Assumed audience and the content of this guide.......................................5 Access the latest documentation ................................................................5 Datatype support........................................................................................6 Product naming conventions......................................................................6 Chapter 1: Overview ........................................................................................................ 7 What is Data Quality Management for Informatica? .......................................8 Data quality functions................................................................................8 Name cleansing..........................................................................................8 Address cleansing ......................................................................................8 Matching function......................................................................................8 Flexible configuration................................................................................8 Chapter 2: Installation ..................................................................................................... 9 System requirements ......................................................................................10 Requirements for Data Quality Management for Informatica.................10 Installation......................................................................................................11 About installation directories...................................................................11 Install Data Quality Management for Informatica (Windows and UNIX) ....12 The steps ..................................................................................................12 Windows 2003 installation ......................................................................12 Step 1: Install Data Services...........................................................................13 Data Services installation notes ...............................................................13 Install sample dataflows to Data Services ...............................................13 DPV, LACSLink, and URAC transforms................................................14 Step 2: Install the Data Quality Management for Informatica plug-in and samples ...........................................................................................................15 Install server plug-in to your Informatica Server(s) ................................15 Install client plug-in to your Informatica client(s) and copy the Data Services workflow samples .....................................................................15 The Data Services sample workflows......................................................16 Step 3: Register the DQM_Plugin.xml file ....................................................17 Register to the repository.........................................................................17 Step 4: Verify the Data Quality Management for Informatica toolbar ..........18 Step 5: Import and set up the sample mappings\workflows...........................19 Import the sample mappings....................................................................19 Set up the sample mappings ....................................................................20 Step 6: Specify the Java class path.................................................................23 Step 7: Verify the installation.........................................................................25 Install Data Quality Management for Informatica on UNIX .........................26 Installation directories .............................................................................26 Permissions ..............................................................................................26
Contents

Add to environment variables ................................................................. 26 Install C++ runtime upgrade (Linux and HP-UX) .................................. 27 If you also use a stand-alone SAP BusinessObjects (Firstlogic) product...... 28 What causes the conflict? ........................................................................ 28 Different user profiles ............................................................................. 28 Scripts on UNIX...................................................................................... 28 Batch files on Windows .......................................................................... 29 Chapter 3: Add and use Data Quality Management for Informatica transformations 31 Data Quality Management for Informatica transformations.......................... 32 Reusable versus nonreusable transformations......................................... 32 Create a new transformation ................................................................... 32 Transformations provided with this product ........................................... 33 Chapter 4: Troubleshoot Data Quality Management for Informatica ..................... 39 Set the tracing level........................................................................................ 40 Logging.......................................................................................................... 41 Connection information ................................................................................. 42 Data Quality Management for Informatica transformation/mapping issues . 43 The sample mappings are failing in the Workflow Manager.................. 43 Data isnt being processed when sent through a Data Quality Management for Informatica transformation ................................................................ 43 Process terminating on signal/exception ................................................. 43 Sample active batch match workflow (wf_ABM) fails ................................. 44 Glossary ........................................................................................................ 45 Index.............................................................................................................. 47

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Preface

About this guide

This guide provides a general overview of SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter as well as specific information for installing and integrating this product into your Informatica environment. Some sections in this guide may not apply to your version of Data Quality Management for Informatica, depending on which product options you purchased. This guide assumes that the people responsible for integrating Data Quality Management for Informatica are experienced users of the various Informatica applications including the PowerCenter Repository Manager, Designer, and Workflow Manager. It does not explain basic operation of the Informatica applications. For questions about the Informatica applications, consult the Informatica documentation or Informatica technical support. This guide also does not explain the operation of the SAP BusinessObjects Data Services. For specific questions about Data Services, consult the Data Services documentation.

Assumed audience and the content of this guide

Access the latest documentation

In addition to the documentation installed on your computer during installation of this product, you can access all the supported SAP Business Objects documentation from the SAP Business Objects Customer Support site. 1. Go to http://help.sap.com. 2. Select the Business Objects tab. 3. In the SAP Business Objects Overview section, select All Products. 4. On the Business Objects Product Guides screen, navigate to the document that you want to view. You can view the PDFs online or save them to your computer.

Conventions

This document follows these conventions:


Convention Description

Bold Italics

Highlights file names, paths, and text that you should type exactly as shown. For example, Type cd\dirs. Both emphasizes text and indicates text for which you should substitute your own data or values. For example, Type a name for your file, and the .txt extension (testfile.txt). Indicates commands that you choose from menus in the following format: Menu Name > Command Name. For example, Choose File > New. Alerts you to important information and potential problems. Points out special cases that you should know about. Draws your attention to tips that may be useful to you.

> in menu commands

About versions
Informatica versions

This product is engineered for Informatica PowerCenter 8.6.1. It is not compatible with previous versions of PowerCenter. This product uses SAP BusinessObjects Data Services XI Release 3.2 (12.2.0.1) Data Quality Management Premium license, to power its data quality processes. This product is not compatible any version prior to 12.2.0.1.

SAP BusinessObjects technology versions

Datatype support Product naming conventions

This product supports only the string datatype. For brevity, SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter is often in this guide called Data Quality Management for Informatica. SAP BusinessObjects Data Services XI Release 3.2 (12.2.0.1) is often in this guide called Data Services.

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Chapter 1: Overview

Before you use SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter, you should understand some of the concepts behind data quality and address cleansing. A basic familiarity with these concepts will help you to understand what Data Quality Management for Informatica does and why your organization benefits from data quality.

Chapter 1: Overview

What is Data Quality Management for Informatica?


Data Quality Management for Informatica is designed to ensure information quality within Informatica PowerCenter installations. With the capabilities provided by Data Services, it can provide batch verification and correction of name and address data, and can include sophisticated processes to identify matching records.
Data quality functions

Data Quality Management for Informatica brings name and address integrity to your Informatica environment through preset transformations that your users can easily integrate into standard Informatica extract, transform, and load (ETL) mapping. The name cleansing capability of this product uses extensive name dictionaries to find name and firm (business name) information and, if necessary, add or fix that information from your source data. For example, based on the name dictionary information, you can add to your source data other name-related components, such as gender, titles, and so on. Data Quality Management for Informatica performs standardization and correction of U.S. and international addresses. You can purchase different international engines for Data Services. The engines provide a different level of address-checking for certain countries. For example, if you own the Canadian engine, Canadian addresses are verified down to the street address level. With the international engine, some countries addresses are only verified to the city level. International address cleansing may not be available in your version of Data Quality Management for Informatica, depending on which product options you purchased. The matching function of Data Quality Management for Informatica brings sophisticated record-matching technology to your Informatica environment. Use this matching capability in concert with address cleansing and name cleansing to improve the quality of your data by identifying matching or duplicate records. You can integrate the data cleansing and matching functions with relatively minor configuration. In addition, your Informatica users can further fine tune their results of Data Quality Management for Informaticas operations by making changes to the Data Services jobs.

Name cleansing

Address cleansing

Matching function

Flexible configuration

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Chapter 2: Installation

Before you can integrate SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter into your application, you must install Data Services and install the Data Quality Management for Informatica plug-in to your Informatica server and client machines. Be sure to follow the instructions appropriate for your operating system.

Chapter 2: Installation

System requirements
This section explains the requirements for installing the Data Quality Management for Informatica plug-in to operate with your Informatica PowerCenter environment. System requirements for installing Data Services can be found in the Data Services documentation
Requirements for Data Quality Management for Informatica
Informatica versions

Installation of the Data Quality Management for Informatica plug-in requires that you have the following software and hardware specifications before installing Data Quality Management for Informatica.

This version of Data Quality Management for Informatica is engineered for PowerCenter 8.6.1. It is not compatible with previous versions of Informatica. Here is a list of the operating systems that are supported in this release. Win 2003 Server (32-bit + 64-bit), Win XP (32-bit + 64-bit), Windows 2008 Server, and Windows Vista (SP1) AIX 5.2 (64-bit), AIX 5.3 (64-bit), and AIX 6.1 (64-bit) HP Itanium 11i V2 11.23 (64-bit) and HP Itanium V3 11.31 (64-bit) Red Hat Linux 4.0 (32-bit and 64-bit) and Red Hat Linux 5.0 (64-bit) Solaris 9 (64-bit) and Solaris 10 (64-bit) SuSE Linux 9.0 (64-bit) and SuSE Linux 10.0 (64-bit) The installation routine you run is specifically designed for your operating system.

Operating systems

10

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Installation
Before you can integrate Data Quality Management for Informatica into your application, you must first install the Data Services. After youve installed Data Services, you can then install the Data Quality Management for Informatica plugin to your Informatica server and client machines. Follow the instructions appropriate for your operating system: Install Data Quality Management for Informatica (Windows and UNIX) on page 12 Install Data Quality Management for Informatica on UNIX on page 26
About installation directories
$PMRootDir

Use these two variables when you enter paths for the Informatica server and the client: $PMRootDir and Client_Install_Dir. In these installation steps, procedures may refer to the Informatica server variable $PMRootDir. This Users Guide uses $PMRootDir as an abbreviation for the installation directory (home) of your PowerCenter Server program. For example, assume that your PowerCenter Server is installed in informatica_server. When the instructions refer to Enter $PMRootDir\ BusinessObjects\, you enter informatica_server\Business Objects (Windows) or /informatica_server/businessobjects (UNIX). If you are entering a path within Informatica or the Data Services interface, you can simply enter the $PMRootDir variable rather than the whole path.

Client_Install_Dir

Client_Install_Dir is an abbreviation we use to refer to the installation directory (home) of your PowerCenter client. The client is only installed on Windows.

Chapter 2: Installation

11

Install Data Quality Management for Informatica (Windows and UNIX)


These installation instructions are applicable to Windows and UNIX. However, there are additional steps to perform if you are a UNIX user. Therefore, UNIX users should read Install Data Quality Management for Informatica on UNIX on page 26 before proceeding with the steps in this section. Though this section applies to both Windows and UNIX installations, the instructions favor Windows terminology. For example, path names contain the backward slash (dir\file), rather than the forward slash (dir/file) and show the GUI used for a Windows installation. UNIX users should substitute appropriate file names and UNIX conventions when following these instructions.

You must install and set up Data Services before you install the Data Quality Management for Informatica plug-in.

The steps

There are several steps involved in installing and setting up Data Quality Management for Informatica in your Informatica environment: Step 1: Install Data Services on page 13 Step 2: Install the Data Quality Management for Informatica plug-in and samples on page 15 Add to environment variables on page 26 (UNIX only) For the user who launches the PowerCenter Client on page 26 Step 3: Register the DQM_Plugin.xml file on page 17 Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 Step 5: Import and set up the sample mappings\workflows on page 19 Step 6: Specify the Java class path on page 23 Step 7: Verify the installation on page 25

Windows 2003 installation

For Windows 2003, you must install both Informatica PowerCenter and the Data Quality Management for Informatica plug-in with either the same user account or with accounts that have the same permissions. In addition, the PowerCenter Server service must be started using an account that has permission to all files in both the PowerCenter installation and the Data Quality Management for Informatica plug-in installation. This ensures that all users can write to the files and folders necessary to run Data Quality Management for Informatica.

12

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 1: Install Data Services


You must install the Data Services following the instructions in the Data Services documentation. This section contains information and additional procedures specific to installing Data Services to use with this product.
Data Services installation notes

You can find the instructions for installing, configuring, and running the Data Services portion of this product in the Data Services documentation. However, using Data Services as a component of this product presents some special circumstances not covered in the Data Services documentation. This section describes some additional issues unique to installing Data Services as part of Data Quality Management for Informatica.

Install Data Services first

You must install Data Services before installing the Data Quality Management for Informatica plug-in. After you have installed and configured Data Services, you can proceed to Step 2: Install the Data Quality Management for Informatica plug-in and samples on page 15. We recommend you install Data Services on a different machine than the one that contains your installation of Informatica PowerCenter. Information about the system requirements for the machine on which you install Data Services are included with the Data Services installation instructions. If you install Data Services on a different machine than the one that contains your installation of Informatica PowerCenter, you must run the installation routine for the client plug-in on the machine that contains Data Services, in addition to the machines that contain your Informatica PowerCenter clients.

Install Data Services on a different machine than Informatica

Data Services previously installed

If you are already a Data Services customer (prior to the purchase and installation of this product) and therefore have already installed a version of Data Services, you must upgrade to SAP BusinessObjects Data Services XI Release 3.2 (12.2.0). See the Data Services documentation for information about upgrading Data Services. When you install Data Services, make note of the following items about your Data Services configuration: Access server host Access server port Web service URL This information is required during each installation of the client plug-in.

Key information about your Data Services installation

Install sample dataflows to Data Services

The installation package for the client plug-in contains sample Data Services dataflows. To use these sample dataflows, you must run the client plug-in installation routine on your Data Services machine, even if there is no Informatica PowerCenter client on the machine. The installation routine detects your Data Services configuration and automatically imports the sample dataflows file.

Chapter 2: Installation

13

DPV, LACSLink, and URAC transforms

When you create a new Data Services USA Regulatory Address Cleanse (URAC) transform, the default configuration of the transform is set with the assumption that your installation of Data Quality Management for Informatica uses the Delivery Point Validation (DPV) and Locatable Address Conversion System Link (LACSLink) features. If you did not purchase one or both of these features, you must change the default settings of each URAC transform that you create. The required change affects three options. This table shows the option, the default value of the option, and the value you should use if you are not using DPV or LACSLink.

Option Group Name/Option

Default Value

Value if not using DPV or LACSLink

ASSIGNMENT_OPTION/ENABLE_DPV ASSIGNMENT_OPTION/ENABLE_LACSLINK

YES or ChangeMe NO YES or ChangeMe NO NO

NON_CERTIFIED_OPTION/DISABLE_CERTIFICATION YES

14

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 2: Install the Data Quality Management for Informatica plug-in and samples
The next installation step is to install the Data Quality Management for Informatica plug-in to your Informatica Server and client machine(s), and to install the sample workflows to your Data Services configuration.
Install server plug-in to your Informatica Server(s)

You must install the server plug-in on each instance of the Informatica Servers you intend to use with this product. To install the server plug-in to your Informatica Server: 1. From the installation package, run the file Server_setup.exe. The Welcome screen appears. 2. Click Next. The License Agreement window appears. 3. Accept the license agreement and click Next. The Ready to Install window appears. 4. Click Install to install the plug-in. 5. When the installation is complete, you are prompted to restart your computer. You dont have to restart it now, but you must restart your computer before you can run Data Quality Management for Informatica. UNIX install: These instructions are for Windows installations of the server plug-in. To install the plug-in on UNIX, run the executable and follow the instructions provided. Also, if you are performing a UNIX install, proceed to Add to environment variables on page 26 and For the user who launches the PowerCenter Client on page 26 before continuing with Step 3: Register the DQM_Plugin.xml file on page 17.

Install client plug-in to your Informatica client(s) and copy the Data Services workflow samples

You must install the client plug-in to each of your Informatica client systems. When run on the machine that contains your Data Services installation, this installation routine also imports the sample Data Services workflows to the configuration. For the purposes of using this product, an Informatica client is defined as any system that runs the Informatica PowerCenter Designer. On the machine containing an Informatica client or your Data Services installation: 1. From the installation package, run setup.exe. The Welcome screen appears. 2. Click Next. The License Agreement window appears. 3. Accept the license agreement and click Next. The PowerCenter Location window appears. If the installation routine can not detect an installation of the Informatica PowerCenter client on this machine, it then skips this section and proceeds to the part of this procedure that installs the Data Services dataflows.

Chapter 2: Installation

15

4. Choose the default or click Change to browse to a destination folder where you want this products files to be installed, then click Next. The first Data Services Connection screen appears. 5. Enter the Data Services Access Server host name and port number in the appropriate fields, then click Next. The next Data Services Connection screen appears. 6. Enter the Data Services Web Services URL. The default value uses the Data Services Access Server IP address you gave in the previous step. Click Next. The Data Services Repository Connection Parameters screen appears. 7. In the Repository Connection Parameters screen, enter the connection details. Click Next. The RDBMS Selection screen appears. 8. In the RDBMS Selection window, choose the database type you intend to use for the creation of staging tables, and click Next. The window that follows this step varies, depending on which database type you chose. Each database type requires different connection data. In the Connection Parameters window that appears for your database type, enter the database connection information and click Next. The Ready to Install window appears. 9. In the Ready to Install the Program window, click Install. The windows update for each steps in the installation routine. 10. Click Install to install the plug-in or the workflows. 11. When the installation is complete, you are prompted to restart your computer. You must restart your computer before you can run this product.
The Data Services sample workflows

The following table describes the Data Services workflows for this product.
Workflow Description

Job_Realtime_DQ_Informatica_ This real-time job is used for the name cleansing. It datacleanse standardizes name fields such as First Name, Middle Name, Last Name, and Job title. Job_Realtime_DQ_Informatica_ This real-time job is used for the address cleansing. addresscleanse It cleanses address fields such as Address, City, Post Code, State, and Country of different countries. All country engines are enabled in this job. Job_Realtime_DQ_Informatica_ This job is used for the transactional matching. It transaction_match takes the first record as a master record and the subsequent records as candidate records. Before doing the matching, it performs name and address cleansing. In the output, it gives the cleansed record and the match score of candidate records with the master record. Job_Batch_DQ_Informatica_ batch_match This batch job is used for the batch matching. It creates the different match groups and according to those match groups it creates the master record and candidate record. It outputs the record in cleansed format. It also provides the group number and the match score for every subordinate record.

16

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 3: Register the DQM_Plugin.xml file


Before you can use Data Quality Management for Informaticas custom transformation, you must register it as a plug-in in the repository, as you would with any plug-in for Informatica PowerCenter. For more information about registering plug-ins, see your Informatica documentation.

Register to the repository

To register the plug-in to the repository on Informatica: 1. In the Informatica PowerCenter Administration Console, place the repository in exclusive mode. See the Informatica documentation for details on this step. 2. In the plug-ins tab for this node, click Register Repository Server plug-in. The Choose file window appears. 3. Browse to the location of the DQM_Plugin.xml plug-in. Select the plug-in, and click Open. The Console displays a screen to register the plug-in.

4. Enter the Repository Username and Password. Check the box for Update existing plug-in registration if you have previously registered this plug-in. Click OK. 5. The Console indicates that it is importing the plug-in. Upon completion, place the repository back into normal mode.

Chapter 2: Installation

17

Step 4: Verify the Data Quality Management for Informatica toolbar


After you install the client software, the Data Quality Management for Informatica toolbar should appear in your PowerCenter Designer application. There are four toolbar items for Data Quality Management for Informatica: Active data cleanse transform Passive data cleanse transform Active transactional match transform Active batch match transform If the toolbar does not appear, perform the following steps: 1. In the PowerCenter Designer, open the appropriate repository. 2. Select Tools > Customize. The Customize window opens.

3. Select the Transformations and the Advanced Transformations options. 4. Click Ok.

18

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 5: Import and set up the sample mappings\workflows


The sample mappings\workflows are installed as an XML object. This object includes sources, targets, and transformations. You must import and set up the sample mappings\workflows for use with Data Quality Management for Informatica.
Import the sample mappings

To import the sample mappings\workflows, follow the normal steps (listed below) for importing an object. 1. In PowerCenter Repository Manager, choose Folder > Create. The Create Folder window opens. 2. In the Name box, enter DQMForInformaticaSamples. 3. Set the appropriate owners, groups, and permissions for the folder and click OK. 4. Choose Repository > Import Objects. The Import Wizard window opens.

5. Click Browse to locate and select the DQM_FOR_INFORMATICA_SAMPLE_WORKFLOWS.XML file and click OK. By default, the sample mappings are installed in c:\Program Files\Business Objects\Data Quality Mgmt for Informatica\ SampleWorkflows. 6. Click Next. The wizards Select Objects to Import window opens.

Chapter 2: Installation

19

7. Click Add All to import all sample sources, targets, transformations, mappings, sessions, and workflows. Click Next. The wizards Resolve Conflicts in Folder window opens. 8. Resolve any conflicts, as necessary. When finished, click Close. 9. Click Import.
Set up the sample mappings

You set up the sample mappings by configuring the DQM Properties in the Edit Transformations window. This product provides five sample mappings: m_ADC m_PDC m_ATM m_ABM_SRC_TO_STG m_ABM_PROCESS You must repeat this procedure for each sample mapping. To set up the sample mappings: 1. In the Mapping Designer, open one of the sample mappings (such as m_ADC). 2. Right-click Edit. The Edit Transformations window opens. Tip: The first time that you open the Edit Transformation window, select the corner of the window and drag it so that you can view all of the tabs. 3. Click the DQM Properties tab.

20

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

4.

You must provide information for the active fields.

A description of any highlighted or selected option appears in the Description box.

5. Set the properties as necessary. See the table in the following section to understand which properties are required or optional. 6. Click Verify ports to check that the fields in the specified Data Services dataflow correspond to the input and output ports on the Informatica transform. Data Services must be running for the verification to be successful. 7. Click Apply and then OK. 8. Repeat this process for each of the sample mappings.
Option descriptions

This table describes each of the properties available for the Data Quality Management for Informatica custom transformations. Descriptions also appear in the Description box in the Edit Transformation window.
Option Description

Real time service

Name of the Data Services real-time service with which this transformation runs. You must ensure that this service is published as a Web Service. Port to be used as the break group key during match operations. The number of threads used to send and receive data with Data Services The number of seconds to wait before each fetch of the status of the batch job. The number of records sent and received with Data Services per call.

Break group key field Number of threads Polling interval Number of records in a bundle

Chapter 2: Installation

21

Option

Description

Processing timeout Maximum records in a break group Connection...

Number of seconds to wait for batch match to complete before the process times out. The maximum number of records sent and received with Data Services per break group. Produces a window that allows you to enter the connection information used to connect to Data Services. For more information, see Connection information on page 42. Produces a window that allows you to enable and configure this products logging for events affecting the Informatica client server. For more information, see Logging on page 41. Click to check that the input and output port schema within the Informatica transformation match the input and output field schema of the Data Services real-time service. This process compares the name and the length of each input and output port of Informatica transformation and Data Services job.

Logging...

Verify ports

22

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 6: Specify the Java class path


This product uses several Java Archive (JAR) files. Within your Informatica Server(s), you must set the Java class path for these JAR files in the Informatica Workflow Manager. For Windows, the installer copies these JAR files to $PMRootDir\Business Objects. For UNIX, the installer copies these JAR files to $PMRootDir/businessobjects. To specify the Java class path: 1. From the Informatica Server Task Developer workspace, double-click the session. The Edit Task window opens. 2. Click the Properties tab.

3. In the Java Classpath attribute, enter the following values: For Informatica Server on Windows: $PMRootDir\Business Objects\DQM_Link.jar;$PMRootDir\Business Objects\UniversalConnector.jar;$PMRootDir\Business Objects\axiom-api1.2.5.jar;$PMRootDir\Business Objects\axiom-impl-1.2.5.jar;$PMRootDir\ Business Objects\axis2-kernel-1.3.jar;$PMRootDir\Business Objects\ commons-codec-1.3.jar;$PMRootDir\Business Objects\commons-httpclient3.0.1.jar;$PMRootDir\Business Objects\commons-logging1.1.jar;$PMRootDir\Business Objects\rtsClient.jar;$PMRootDir\Business Objects\stax-api-1.0.1.jar;$PMRootDir\Business Objects\ UCDataPlayer.jar;$PMRootDir\Business Objects\wsdl4j1.6.2.jar;$PMRootDir\Business Objects\wstx-asl-3.2.1.jar;$PMRootDir\ Business Objects\xalan-2.7.0.jar;$PMRootDir\Business Objects\ XmlSchema-1.3.2.jar

Chapter 2: Installation

23

For Informatica Server on Unix: $PMRootDir/businessobjects/DQM_Link.jar;$PMRootDir/businessobjects/ UniversalConnector.jar;$PMRootDir/businessobjects/axiom-api1.2.5.jar;$PMRootDir/businessobjects/axiom-impl-1.2.5.jar;$PMRootDir/ businessobjects/axis2-kernel-1.3.jar;$PMRootDir/businessobjects/commonscodec-1.3.jar;$PMRootDir/businessobjects/commons-httpclient3.0.1.jar;$PMRootDir/businessobjects/commons-logging1.1.jar;$PMRootDir/businessobjects/rtsClient.jar;$PMRootDir/ businessobjects/stax-api-1.0.1.jar;$PMRootDir/businessobjects/ UCDataPlayer.jar;$PMRootDir/businessobjects/wsdl4j1.6.2.jar;$PMRootDir/businessobjects/wstx-asl-3.2.1.jar;$PMRootDir/ businessobjects/xalan-2.7.0.jar;$PMRootDir/businessobjects/XmlSchema1.3.2.jar Most PDF viewers offer the capability to copy text.

24

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Step 7: Verify the installation


After installing and configuring the software, you should set up and run the sample workflows provided with this product to verify that the installation process was successfully completed. First you must shutdown and restart your computer if you havent already done so during the installation procedure. This product provides the following sample workflows: wf_ADC wf_PDC wf_ATM wf_ABM To verify the installation by running the sample mappings: 1. In PowerCenter Workflow Manager, connect your server to your repository. 2. Run one of the workflows (for example wf_ADC) using your PowerCenter Workflow Manager. If the session completes successfully, Data Quality Management for Informatica was properly installed for that module. If the session does not properly complete, verify that you have followed all of the installation steps. 3. Repeat this process for each sample workflow.

Chapter 2: Installation

25

Install Data Quality Management for Informatica on UNIX


The installation procedures for UNIX are similar to the procedures for Windows. For installation steps that apply to both Windows and UNIX, see Install Data Quality Management for Informatica (Windows and UNIX) on page 12. Read this section for any additional installation steps, or for information that relates directly to your UNIX installation.
Installation directories

See About installation directories on page 11 to learn about directory variables that you can use when entering the default location for Informatica Server and PowerCenter. The owner of your Informatica Server process should perform the installation if possible. Otherwise, must go back and set appropriate permissions to the Data Quality Management for Informatica files and directories after the installation. Important: Before running the Informatica Server process (PMServer), you should change your default permissions for your Informatica user to rw-rw-r--. When you run the server process, certain files are overwritten and the server process owner becomes the owner of those files. Using rw-rw-r-- permissions ensures that people in the group will still have write access to these files.

Permissions

After you perform Step 4: Verify the Data Quality Management for Informatica toolbar on page 18, you must add to your environment variables and install the Linux C++ runtime upgrade.
Add to environment variables

Add or edit the following environment variables in your .login or .profile file (in the Bourne shell), or in the .login or .cshrc file (in the C shell). Note: If you also use a stand-alone (batch) Business Objects or Firstlogic application on the same machine, you may encounter conflicts in some situations. To avoid this conflict, we recommend that you set up different user profiles: one for Data Quality Management for Informatica and one for the stand-alone application.

For the user who launches the PowerCenter Client

In the PATH environment variable, the user who launches the PowerCenter client must add the path <INFA_HOME>/java/jre/bin/client.

26

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Install C++ runtime upgrade (Linux and HP-UX)

If you use Red Hat Linux, you must install the GNU Standard C++ Library, version 3.2.3 (for libstdc++.so.5 support). We have found a technical issue requiring this new C++ runtime support. To learn more, go to http://www.redhat.com/apps/download and search for the keyword libstdc++. Be sure to review Red Hats specific instructions in the System Environment Libraries section related to dependencies. If you need further assistance, contact Red Hat. If you use HP-UX, you must install the HP-UX aC++ runtime environment version A.03.26 or later. To learn more and download this runtime environment, go to http://docs.hp.com/en/5187-2783/ch15s03.html. Continue with Step 5: Import and set up the sample mappings\workflows on page 19 to complete your installation.

Chapter 2: Installation

27

If you also use a stand-alone SAP BusinessObjects (Firstlogic) product


Some users of Data Quality Management for Informatica also use a stand-alone (batch) SAP BusinessObjects (Firstlogic) application. If Data Quality Management for Informatica and the stand-alone application are on the same machine, you may encounter conflicts in some situations.
What causes the conflict?

Your PATH and PW_PATH environment variables point to locations of your Business Objects products. If you try to run Data Quality Management for Informatica but your environment variable points to the stand-alone application location first, Data Quality Management for Informatica will try to use the software installed for the stand-alone application, and vice versa. To avoid this conflict, Business Objects recommends that you set up different user profiles: one for Data Quality Management for Informatica and one for the stand-alone application. The environment variables in Data Quality Management for Informatica user profile should point to the location of the Data Quality Management for Informatica software. For UNIX, this location is specified in Add to environment variables on page 26. For Windows, the environment variables are automatically set up during installation. The environment variables in the stand-alone application user profile should point to the location of the stand-alone application.

Different user profiles

Scripts on UNIX

If you run Data Quality Management for Informatica on a UNIX system, you can run Data Quality Management for Informatica and the stand-alone application with the same user profile. To do this, your login script must initially set the environment variables as appropriate for Data Quality Management for Informatica. You must also create a script (or enter the appropriate command) that resets your environment variables to the settings required by the stand-alone application. 1. Login with the profile set up for Data Quality Management for Informatica. 2. Start the Informatica Server process. 3. Run the script (or enter the appropriate commands) to set your environment variables for your stand-alone application. For example:
setenv PATH /postware/ace setenv PW_PATH $PATH

4. Run your stand-alone application, as needed. 5. Repeat these steps each time you log on or when you want to restart the Informatica Server process.

28

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Batch files on Windows

On Windows, you can use batch files to run the stand-alone application and Data Quality Management for Informatica from the same system. Follow these steps: 1. Create a batch file that sets the proper environment variables for your standalone application and launches the stand-alone application. The following example shows a batch file for launching an ACE job file.
@echo off set path=C:\pw\adm;C:\pw\ace;%path% set pw_path=C:\pw\adm;C:\pw\ace;%pw_path% pwace C:\pw\acejob.ace
This line launches the ACE job-file program. These two commands insert the appropriate paths to the ACE-related environment variables before the other paths.

2. Create additional batch files for each stand-alone application you want to run. When you use this procedure, the application you run from the batch file uses the environment you set up in the batch file. All other applications you run use the normal environment variables set up for the user who is logged in. A command-prompt window remains open until you close the application that you launched from the batch file. You may want to create a shortcut to the batch files and place the shortcut on your desktop or in your Start menu. You can then double-click the shortcut to run the batch file and launch the stand-alone application.

Chapter 2: Installation

29

30

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Chapter 3: Add and use Data Quality Management for Informatica transformations

After youve installed the SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter software, there are still other steps necessary before you can use and run Informatica processes that use this product. Read this chapter to get an idea of which type of transformation, active or passive, is right for your task. Then refer to the appropriate sections of this chapter for information about setting up and using a transformation.

Chapter 3: Add and use Data Quality Management for Informatica transformations

31

Data Quality Management for Informatica transformations


Before incorporating Data Quality Management for Informatica transformations into your Informatica mappings, you must decide which type of transformation to use. This section explains the reusability of transformation, and shows how to create transformations based on the four transformation types provided with this product.
Reusable versus nonreusable transformations

You can reuse a transformation if you create it in the Transformation Developer by using an instance of that transformation in mappings. If you make changes in the main transformation, the changes are inherited by each instance of that transformation. You cannot make any changes within an instance of the transformation. When you create a transformation within a mapping using the Mapping Developer, it is not reusable in other mappings, but it is modifiable. You can make nonreusable transformations reusable by selecting the Reusable check box in the MetaData Extensions tab of the Transformation Developer. When you do this, Data Quality Management for Informatica places a reusable version of the transformation in the Transformations folder and makes your original transformation a non-modifiable instance. If you want to fine-tune an instance of Data Quality Management for Informaticas transformations for the mapping process into which it is incorporated, you should create the transformation within the mapping.

Create a new transformation

You can create a transformation to use with this product that is blank, or one based on a real-time service or a batch job. When you create a transformation, you are provided the Create transformation window to identify your Data Services connection and choose the type of transformation. This table provides brief descriptions for each of the options on the Create transformation windows.
Option Description

Option descriptions

Access server host

The name of your Data Services host machine. Data Quality Management for Informatica automatically completes this field based on your Data Services setup. The number of your Data Quality Access Servers port. Data Quality Management for Informatica automatically completes this field based on your Data Quality Server setup. The encoded URL for the Data Services Web Service pointing to the WSDL. The name of the Data Services Real-time service with which this transformation runs. You must ensure that this service is also published as a Web Service.

Access server port

Web service URL Real time service

32

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Option

Description

Create transform based on real time service or Create transform based on batch job Create blank transformation

Click to create a new transformation based on the sample workflows provided with this product.

Click to create a new transformation that does not have ports or settings.

Transformations provided with this product

This product provides the following transformations: Passive data cleanse transformation Active data cleanse transformation Active transformational match transformation Active batch match transformation Descriptions of these transformations and the procedure for creating them are in the following sections.

Passive data cleanse transformation

For flexibility in mappings and ease of setup, Data Quality Management for Informaticas passive data cleanse transformation is the best choice. Consider using this type of transformation if you are not using the match capability of Data Services and if you are new to Data Quality Management for Informatica. This transformation outputs one row for every input row. To create a passive data cleanse transformation: 1. Open the Informatica PowerCenter Designer and the appropriate repository. 2. Open the folder in which you want to work. 3. Choose Tools > Transformation Developer. 4. On the Data Quality Management for Informatica toolbar, click the DQM passive data cleanse transformation icon. If the toolbar is not displayed, see Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for Windows) or Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for UNIX). 5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens. 6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.) 7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.) or
Chapter 3: Add and use Data Quality Management for Informatica transformations

33

To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.) The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_PDC sample mapping that ships with this product.
Active data cleanse transformation

The active data cleanse transform provides the best performance in data cleanse operations. Consider using this type of transformation if you are not using the match capability of Data Services, but are able to configure a transformation more complicated than the passive data cleanse transformation. To create an active data cleanse transformation: 1. Open the Informatica PowerCenter Designer and the appropriate repository. 2. Open the folder in which you want to work. 3. Choose Tools > Transformation Developer. 4. On the Data Quality Management for Informatica toolbar, click the DQM active data cleanse transformation icon. If the toolbar is not displayed, see Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for Windows) or Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for UNIX). 5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens. 6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.) 7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.) or To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.) The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ADC sample mapping that ships with this product.

Active transformational match transformation

The active transformational match transform provides real-time match capability. To create an active transformational match transformation:

34

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

1. Open the Informatica PowerCenter Designer and the appropriate repository. 2. Open the folder in which you want to work. 3. Choose Tools > Transformation Developer. 4. On the Data Quality Management for Informatica toolbar, click the DQM active transformational match transformation icon. If the toolbar is not displayed, see Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for Windows) or Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for UNIX). 5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens. 6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and real time service. (Make sure that this real time service is published as a Web Service within Data Services.) 7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job in order, datatype, and length.) or To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.) The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ATM sample mapping that ships with this product. To use this transformation, create an Informatica sorter transformation to use just prior to this transformation in the mapping. The port for which you have specified the primary sorting order into sorter transformation leads to the break group key field. Specify the name of this transformations port (mapped with sorter's primary sorting order port) in the DQM Properties Break group key field.
Active batch match transformation

The active batch match transform provides batch match capability. To create an active batch match transformation: 1. Open the Informatica PowerCenter Designer and the appropriate repository. 2. Open the folder in which you want to work. 3. Choose Tools > Transformation Developer. 4. On the Data Quality Management for Informatica toolbar, click the DQM active batch match transformation icon.

Chapter 3: Add and use Data Quality Management for Informatica transformations

35

If the toolbar is not displayed, see Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for Windows) or Step 4: Verify the Data Quality Management for Informatica toolbar on page 18 (for UNIX). 5. Click the area of the Transformation Developer workspace or the mapping designer where you want to place the transformation. The Create Transformation window opens. 6. Make any necessary changes to the available fields for the Data Services Access Server host name, port number, Web Services URL, and batch job name. (Make sure that this batch job is published as a Web Service within Data Services.) 7. To create a blank transformation, click Create blank transformation. (You must later manually create input and output ports that match the input and output column on the Data Services job.You can create them with any name. For simplicity, use I_Input as input port and O_Output as output port. The datatype for both ports must be String.) or To create a transformation based on an existing dataflow, click Create transform based on real time service. (The transformation has its input and output ports defined by the corresponding Data Services job.) The transformation is created and placed on your Transformation Developer workspace. You must create an Informatica mapping to use this transformation. For details about this transform, examine the m_ABM sample mapping that ships with this product. This transformation enables this product to execute the associated batch job. Make sure that the RDBMS table used in Data Services batch job can be accessed from Informatica. In Informatica, both the input and output tables of the Data Services batch job are treated as staging tables. The following is the ETL design for using the active batch match transformation. 1. Create a mapping in Informatica that loads all source data to the Input Stage table.This input stage table is the same RDBMS table used in the Data Services batch job as the input table. Refer to the m_ABM_SRC_TO_STG sample mapping shipped with this product for detailed information. 2. Create a second mapping that has a dummy source file. This dummy source file should have only one record of one column, with a value such as Start. (This value is independent, so you can specify any other value.) 3. In this second mapping, you should use the active batch match transformation immediately after the dummy source. After the active batch match transformation, you can use an Informatica router transformation that leads to a SQL transformation in case of success. In this SQL transformation, access the output staging table. This output staging table is the same RDBMS table used in the Data Services batch job as the output table. Once all the matched records are returned to Informatica, you can send them directly to the target or the next stage for further processing. For more details, refer to the m_ABM_PROCESS sample mapping provided with this product.

36

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

If you want to execute the active batch match workflow (wf_ABM), then make these changes: 1. Open the wf_ABM workflow. 2. Go to the Task Developer and edit the s_ABM_SRC_TO_STG session. 3. Go to Mapping > Targets and specify the Relational Connection. This setting should be same as in the Data Services configuration. For example, if you are using Oracle as the Data Services configuration and DQM_Win32 as the user, then this relational connection should be mapped with this information. Make sure that you have created this staging table for this Configuration. You can execute the batch job Job_DQ_Informatica_CreateStageTables imported from this product. This batch job creates both the input and output staging tables for that configuration. 4. Click OK and save. 5. Edit the s_ABM_PROCESS session. Click on Mapping > Transformations >TR_ABM_STG_ TO_TGT_SQL and specify the Relational Connection. This connection should be the same as in the Data Services Configuration.

Chapter 3: Add and use Data Quality Management for Informatica transformations

37

38

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Chapter 4: Troubleshoot Data Quality Management for Informatica

If you cant get something to work properly within SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter, this section may help you correct the problem. This section deals mostly with issues within Data Quality Management for Informatica itself. At times, issues may actually be caused by your Informatica applications. Be sure to consult the Informatica PowerCenter documentation as well.

Chapter 4: Troubleshoot Data Quality Management for Informatica

39

Set the tracing level


In addition to setting the DQM properties, you may also want to use the Edit Transformations window to control the level of information recorded in the Informatica log files when you run a mapping. You control this through the Tracing Level setting on the Properties tab. 1. From the Transformation Developer workspace, double-click the transformation and click the Properties tab of the Edit Transformation window.

2. In the Tracing Level property, set the appropriate level. In addition to the normal logging produced by any of the other settings, the Tracing Level Verbose Data creates two files, <Transformation Name>_Reader.log and <Transformation Name>_Writer.log, in the Informatica Servers bin directory.

40

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Logging
The DQM Properties tab of the Edit Transformations screen gives you access to the logging capabilities of this product. When you click the logging button, the following screen appears.

In this screen you can configure the following settings:


Option Description

Enable logging Client component log directory Server component log directory Enable playback

Controls whether logging is enabled. Specifies the location of the client log files. Specifies the location of the server log files. Controls use of data playback files, a more detailed version of logging. This setting should remain unchecked unless used in an engagement with Customer Support.

Chapter 4: Troubleshoot Data Quality Management for Informatica

41

Connection information
The DQM Properties tab of the Edit Transformations screen gives you access to the logging capabilities of this product. When you click the logging button, the following screen appears.

In this screen you can configure the following settings:


Option Description

Access server host Access server port Web service URL Time out Retry attempts

The name of your Data Services host machine. The number of your Data Quality Access Servers port. The encoded URL for the Data Services Web Service pointing to the WSDL. The number of seconds to try to connect to the Data Services Access server before failure. The number of times to retry connection to the Data Services Access server after failure.

Delay between attempts The number of seconds in between retry attempts.

42

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Data Quality Management for Informatica transformation/ mapping issues


The sample mappings are failing in the Workflow Manager
Did you install in the default location?

If you try to run the sample mappings through the Workflow Manager but they fail to complete, there are a few things to check.

If you installed the Data Quality Management for Informatica plug-in in a directory other than the default, you will have to edit some of the session properties and transformation properties. Edit the following items so that they point to the location where you installed the Data Quality Management for Informatica plug-in: Location of the source in the Source Options. Location of the target in the Target Options. Follow these steps to get to these properties: 1. In the Workflow Manager, Double-click on the workflow in the Workflow Designer tool. The Edit Tasks window opens. 2. Select the Mapping tab. 3. Open the Source or Target folders to edit the locations.

Do the Data Quality Management for Informatica plug-ins files and directories have appropriate permissions? Is the Java class path set correctly?

Double-check to be sure that you have properly set the Data Quality Management for Informatica plug-in files permissions for the person who is running the session. The directory where the sample job files are located needs write access for the session owner, for example.

Setting the Java class path is a new step of installation with this version of Data Quality Management for Informatica. To ensure that you have set the Java class path correctly, see Step 6: Specify the Java class path on page 23. If your mappings are running, but data isnt being cleansed or consolidated, Data Quality Management for Informatica may not be properly installed. If you have other transformations that are working properly, you know that Data Quality Management for Informatica is installed properly. Try running one of the mappings in the sample mapping folder if youre unsure. See Appendix A: Sample mappings on page 44 for more information about running a mapping. If the samples dont work, verify that youve performed all of the installation procedures correctly as documented in Chapter 2: Installation on page 9. You might receive this error if you improperly edit some of the properties in a custom transformation. To fix this error, delete the custom transformation from your mapping and add a new custom transformation. Do not modify the Procedure Name property or the ProcIndex properties from their initial values.

Data isnt being processed when sent through a Data Quality Management for Informatica transformation

Process terminating on signal/exception

Chapter 4: Troubleshoot Data Quality Management for Informatica

43

Sample active batch match workflow (wf_ABM) fails


If the active batch match workflow (wf_ABM) fails upon exectution, follow these steps: 1. Open the wf_ABM workflow. 2. Go to the Task Developer and edit the s_ABM_SRC_TO_STG session. 3. Go to Mapping > Targets and specify the Relational Connection. This setting should be same as in the Data Services configuration. For example, if you are using Oracle as the Data Services configuration and DQM_Win32 as the user, then this relational connection should be mapped with this information. Make sure that you have created this staging table for this Configuration. You can execute the batch job Job_DQ_Informatica_CreateStageTables imported from this product. This batch job will create both the input and output staging tables for that configuration. 4. Click OK and save. 5. Edit s_ABM_PROCESS session. 6. Click on Mapping > Transformations >TR_ABM_STG_ TO_TGT_SQL and specify the Relational Connection. This connection should be same as in the Data Services Configuration.

44

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Glossary

active transformation A transformation in which Informatica PowerCenter can export records in succession without waiting for a return record. custom transformation An Informatica PowerCenter mapping element, which receives every row of data through input ports, performs some operation on that data set, and returns a set of output rows through output ports. These are implemented with two types of transformations, active and passive. Data Services Data Services is the Business Objects technology that performs data quality operations on the back end of Data Quality Management for Informatica. dictionaries Files that contain a lexicon of words and phrases that BusinessObjects Data Quality XI for Informatica uses to identify, parse, and standardize address data. These files have a .dct extension. directories Files of national address data compiled by the USPS and used by Data Quality to match, assign, standardize, and verify addresses. These files have a .dir extension.

discrete fields Input or output data that has separate fields for each piece of information, such as addresses and names. metadata extensions Key/value pairs defined within a transformation. These provide flexibility, allowing you to store different data types of information including a blob of information. multiline fields Input or output data that has certain address and name data combined in one field. passive transformation A transformation that outputs one row for every input row. transformation A repository object that generates, modifies, or passes data. BusinessObjects Data Quality XI for Informatica provides both active and passive custom transformations for Informatica PowerCenter.

Glossary

45

46

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

Index

A
active batch match transformation, 35 active data cleanse transformation, 34 active transformation, 45 active transformational matchtransformation, 34 address cleansing international, 8 audience, 5

extract, transform, and load, See ETL

F
failed sample mapping, 43

I
installation Data Services, 13 UNIX, 26 international address cleansing transformation, 8

B
Business Objects (Firstlogic) software stand-alone products, 28

C
C++ runtime, 27 Create Transformation window, 33, 34, 35, 36 custom transformations definition, 45 overview, 32

L
LACSLink, 14 log file tracing level, 40

M
mappings troubleshoot, 43 matching function, 8 metadata extensions, 45 multiline fields, 45

D
data cleansing, 8 Data Quality Management for Iformatica using with Business Objects (Firstlogic) stand-alone products, 28 Data Quality Management for Informatica installing, 9 transformations, 32 troubleshoot, 39 Data Quality Management for Informatica transformation toolbar, 33 Data Quality Management for Informatica transformations, 32 Data Services installation, 13 dataflow Data Services sample, 13 datatypes supported, 6 default installation location, 43 Delivery Point Validation, See DPV dictionaries, 45 directories definition, 45 discrete fields definition, 45 DPV, 14 DQM Properties tab, 20

N
name cleansing, 8 transformations, 8 non-reusable vs reusable transformations, 32

O
overview, 7 Data Quality XI for Informatica, 8

P
passive data cleanse transformations, 33 passive transformation, 45 Preface, 5 Properties tab, 40

R
reusable vs non-reusable transformations, 32

S
sample mappings failed, 43 stand-alone Business Objects (Firstlogic) products, 28 string datatype, 6

E
ETL, 8

Index

47

T
toolbar, 33 tracing level, 40 property, 40 transformation, 45 active batch match, 35 active data cleanse, 34 active transformational match, 34 create, 32 creating and using, 31 name cleansing, 8 overview, 32 passive data cleanse, 33 reusable vs non-reusable, 32 toolbar, 33 troubleshoot, 43

transforms URAC, 14 troubleshoot, 39

U
UNIX setting up user profiles, 28 user profiles, 28

V
versions, 6

W
Windows setting up user profiles, 29

48

SAP BusinessObjects Data Quality Management, version for Informatica PowerCenter Users Guide

S-ar putea să vă placă și