Documente Academic
Documente Profesional
Documente Cultură
how can we eliminate duplicate rows from flat file? Use Sorter Transformation. When you configure the Sorter Transformation to treat output rows as distinct, it configures all ports as part of the sort key. It therefore discards duplicate rows compared during the sort operation Discuss which is better among incremental load, Normal Load and Bulk load If the database supports bulk load option from Informatica then using BULK LOAD for initial loading the tables is recommended. Depending upon the requirement we should choose between Normal and incremental loading strategies. But Normal load is Better. why dimension tables are DE normalized in nature ? Because in Data warehousing historical data should be maintained, to maintain historical data means suppose one employee details like where previously he worked, and now where he is working, all details should be maintain in one table, if u maintain primary key it wont allow the duplicate records with same employee id. so to maintain historical data we are all going for concept data warehousing by using surrogate keys we can achieve the historical data(using oracle sequence for critical column). so all the dimensions are marinating historical data, they are de normalized, because of duplicate entry means not exactly duplicate record with same employee number another record is maintaining in the table. How to retrive the records from a rejected file. explane with syntax or example Every time u run the session, one reject file will be created and all the reject files will be there in the reject file. u can modify the records and correct the things in the records and u can load them to the target directly from the reject file using Regect loader. how to get the first 100 rows from the flat file into the target? 1. Use test download option if you want to use it for testing. 2. Put counter/sequence generator in mapping and perform it. What is Data cleansing..? This is nothing but polishing of data. For example of one of the sub system store the Gender as M and F. The other may store it as MALE and FEMALE. So we need to polish this data, clean it before it is add to Data warehouse. Other typical example can be Addresses. The all sub systems maintains the customer address can be different. We might need a address cleansing to tool to have the customers addresses in clean and neat form. what is a transformation? A transformation is repository object that pass data to the next stage(i.e to the next transformation or target) with/with out modifying the data. How do you create single lookup transformation using multiple tables? If you want single lookup values to be used in multiple target tables this can be done!!! For this we can use unconnected lookup and can collect the values from source table in any target table depending upon the business rule can we eliminate duplicate rows by using filter and router transformation ? We can eliminate the duplicate rows by checking the distinct option in the properties of the transformation and can use SQL query for uniqueness if the source is Relational
If a session fails after loading of 10,000 records in to the target. How can u load the records from 10001th record when u run the session next time in Informatica 6.1? Running the session in recovery mode will work, but the target load type should be normal. If its bulk then recovery wont work as expected. or Skip the number of initial rows, to skip to 10001 in the source type option from session property. At the max how many transformations can be used in a mapping? In a mapping we can use any number of transformations depending on the project, and the included transformations in the particular related transformations. How many types of dimensions are available in Informatica? One major classification we use in our real time modeling is Slowly Changing Dimensions type1 SCD: If you want to load an updated row of previously existed row the previous data will be replaced. So we lose historical data. type2 SCD: Here we will add a new row for updated data. So we have both current and past records, which aggrees with the concept of datawarehousing maintaining historical data. type3 SCD: Here we will add new columns. but mostly used one is type2 SCD. we have one more type of dimension that is CONFORMED DIMENSION: The dimension which gives the same meaning across different star schemas is called Conformed dimension. Ex: Time dimension. Where ever it was, gives the same meaning What are variable ports and list two situations when they can be used? We have mainly three ports Import, out port, Variable port. Import represents data is flowing into transformation. Out port is used when data is mapped to next transformation. Variable port is used when mathematical calculations are required. What is the look up transformation? Use lookup transformation in our mapping to lookup data in a relational table, view , synonym. Informatica server queries the look up table based on the lookup ports in the transformation. It compares the lookup transformation port values to lookup table column values based on the look up condition. How can you improve the performance of Aggregate transformation? we can improve the aggregator performance in the following ways
In my source table 1000 records are there. I want to load 501 rec to 1000 rec into my Target table?how can u do this ? In db2 we write statement as fetch first 500 rows only.in Informatica we can do by using sequence generator and filter out the row when exceeds 500. or You can override the SQL Query in Workflow Manager. Like. select * from tab_name where rownum<=1000 minus select * from tab_name where rownum<=500; how is the union transformation active transformation? Active Transformation: the transformation that change the no. of rows in the Target. Source (100 rows) > Active Transformation > Target (< or > 100 rows) Passive Transformation: the transformation that does not change the no. of rows in the Target. Source (100 rows) > Passive Transformation > Target (100 rows) Union Transformation: in Union Transformation, we may combine the data from two (or) more sources. Assume, Table-1 contains 10 rows and Table-2 contains 20 rows. If we combine the rows of Table-1 and Table-2, we will get a total of 30 rows in the Target. So, it is definetly an Active Transformation. Why we use lookup transformations? Use a Lookup transformation in your mapping to look up data in a relational table, view, or synonym. Import a lookup definition from any relational database to which both the Informatica Client and Server can connect. You can use multiple Lookup transformations in a mapping. what are the UTPS? Utps are done to check the mappings are done according to given business rules.utp is the (unit test plan ) done by deveploper.
If there is a match for the lookup condition, the PowerCenter Server returns the result of the lookup condition into the return port. Pass one output value to another transformation. The lookup/output/return port passes the value to the transformation calling :LKP expression. Does not support user-defined default values.
Pass multiple output values to another transformation. Link lookup/output ports to another transformation.
What r the connected or unconnected transforamations? An unconnected transforamtion is not connected to other transformations in the mapping.Connected transforamation is connected to other transforamtions in the mapping. how we validate all the mappings in the repository at once? You can not validate all the mappings in one go. But you can validate all the mappings in a folder in one go and continue the process for all the folders. For dooing this, log on to the repository manager. Open the folder, then the mapping sub folder, then select all or some of the mappings(by pressing the shift or control key, ctrl+A does not work) and then rightclick and validate. why did u use update stategy in your application? Update Strategy is the most important transformation of all Informatica transformations. (more) what is Partitioning ? where we can use Partition? wht is advantages?Is it necessary? In informatica we can tune performance in 5 different levels that is at source level,target level,mapping level,session level and at network level. So to tune the performance at session level we go for partitioning and again we have 4 types of partitioning
What is the use of incremental aggregation? Explain me in brief with an example. Incremental aggregation is in session properties i have 500 records in my source and again i got 300 records if u r not using incremental aggregation what are calculation r using on 500 records again that calculation will be done on 500+ 300 records, if u r using incremental aggregation calculation will be done one only what are new records (300) that will be calculated dur to this one performance will increasing. What is source qualifier transformation? When you add a relational or a flat file source definition to a mapping, you need to connect it to a Source Qualifier transformation. The Source Qualifier represents the rows that the Informatica Server reads when it executes a session. (more) What r the types of groups in Router transformation? A Router transformation has the following types of groups: Input Output
Input Group The Designer copies property information from the input ports of the input group to create a set of output ports for each output group. Output Groups There are two types of output groups:
You cannot modify or delete output ports or their properties. What r the diffrence between joiner transformation and source qualifier transformation? U can join hetrogenious data sources in joiner transformation which we can not achieve in source qualifier transformation. U need matching keys to join two relational sources in source qualifier transformation.Where as u doesnt need matching keys to join two sources. Two relational sources should come from same datasource in sourcequalifier.U can join relatinal sources which r coming from diffrent sources also. How can U improve session performance in aggregator transformation? You can use the following guidelines to optimize the performance of an Aggregator transformation. Use sorted input to decrease the use of aggregate caches. Sorted input reduces the amount of data cached during the session and improves session performance. Use this option with the Sorter transformation to pass sorted data to the Aggregator transformation. Limit connected input/output or output ports. Limit the number of connected input/output or output ports to reduce the amount of data the Aggregator transformation stores in the data cache. Filter before aggregating. If you use a Filter transformation in the mapping, place the transformation before the Aggregator transformation to reduce unnecessary aggregation. How many types of TASKS we have in Workflomanager? What r they? 1) session2) command 3) email4) event-wait5) event-raise6) assignment7) control8) decision9) timer10) worklet3), , 9) are self explanatory. 1) run mappings. 2) run OS commands/scripts. 4 + 5) raise user-defined or pre-defined events and wait for the the event to be raised. 6) assign values to workflow var 10) run worklets. How do we load from PL/SQL script into Informatica mapping? You can use StoredProcedure transformation. There you can specify the pl/sql procedure name. when we run the session containing this transformation the pl/sql procedure will get executed. How to move the mapping from one database to another? here are 2 ways of doing this.1. Open the mapping you want to migrate. Go to File Menu - Select Export Objects and give a name - an XML file will be generated. Connect to the repository where you want to migrate and then select File Menu - Import Objects and select the XML file name. 2. Connect to both the repositories.Go to the source folder and select mapping name from the object navigator and select copy from Edit menu. Now, go to the target folder and select Paste from Edit menu. Be sure you open the target folder. How to join two tables without using the Joiner Transformation.
how do u check the source for the latest records that are to be loaded into the target? a) Create a lookup to target table from Source Qualifier based on primary Key. b) Use and expression to evaluate primary key from target look-up. ( If a new source record look-up primary key port for target table should return null). Trap this with decode and proceed. what is the logic will you implement to load the data in to one factv from n number of dimension tables? To load data into one fact table from more than one dimension tables . firstly u need to create afact table and dimension tables. later load data into individual dimensions by using sources and transformations(aggregator,sequence generator,lookup) in mapping designer then to the fact table connect the surrogate to the foreign key and the columns from dimensions to the fact. Without using Update strategy and sessons options, how we can do the update our target table? if ur database is teradata we can do it with a tpump or mload external loader. update override in target properties is used basically for updating the target table based on a non key column.e.g update by ename.its not a key column in the EMP table.But if u use a UPD or session level properties it necessarily should have a PK. What are the Differences between Informatica Power Center versions 6.2 and 7.1, also between Versions 6.2 and 5.1? the main difference between informatica 5.1 and 6.1 is that in 6.1 they introduce a new thing called repository server and in place of server manager(5.1), they introduce workflow manager and workflow monitor. In ver 7x u have the option of looking up (lookup) on a flat file. U can write to XML target. how to enter same record twice in target table? Declare Target table twice in the mapping and move the output to both the target tables. or use this syntax, insert into table1 select * from table1 (table1 is the name of the table) Informatica Live Interview Questions Explain grouped cross tab? Explain reference cursor What are parallel querys and query hints (more) How do u select duplicate rows using informatica?
10
11
12
What r the types of lookup caches? Persistent cache: U can save the lookup cache files and reuse them the next time the informatica server processes a lookup transformation configured to use the cache. Recache from database: If the persistent cache is not synchronized with he lookup table,U can configure the lookup transformation to rebuild the lookup cache. Static cache: U can configure a static or readonly cache for only lookup table.By default informatica server creates a static cache.It caches the lookup table and lookup values in the cache for each row that comes into the transformation.when the lookup condition is true,the informatica server does not update the cache while it prosesses the lookup transformation. Dynamic cache: If u want to cache the target table and insert new rows into cache and the target,u can create a look up transformation to use dynamic cache.The informatica server dynamically inerts data to the target table. shared cache: U can share the lookup cache between multiple transactions.U can share unnamed cache between transformations inthe same maping. what are semi additve measures and fully additive measures? there are three types of facts 1.additive 2.semi additive 3. non-additive additve means when a any measure is queried of the fact table if the result relates to all the diemension table which are linked to the fact semi-additve when a any measure is queried from the fact table the results relates to some of the diemension table non-additive when a any measure is queried from the fact table if it does nt relate to any of the diemension and the result is driectly from the measures of the same fact table ex: to calculate the total percentage of loan just we take the value from the fact measure(loan) divide it with 100 we get it without the diemension.
13
14
15
16
how u will create header and footer in target using informatica? you can always create a header and a trailer in the target file using an aggregator transformation. Take the number of records as count in the aggregator transformation. create three separate files in a single pipeline. One will be your header and other will be your trailer coming from aggregator. The third will be your main file. Concatenate the header and the main file in post session command usnig shell script. What is meant by EDW? Its a big data warehouses OR centralized data warehousing OR the old style of warehouse. Its a single enterprise data warehouse (EDW) with no associated data marts or operational data store (ODS) systems. What will happen if you are using Update Strategy Transformation and your session is configured for insert?
17
18
Posted in Oracle & Misc, Informatica Interview Qs | No Comments What r the methods for creating reusable transforamtions?
19
20
21
22
23
24
what is difference between dimention table and fact table and what are different dimention tables and fact tables? In the fact table contain measurable data and less columns and meny rows, Its contain primarykey Diffrent types of fact tables: additive,non additive, semi additive In the dimensions table contain textual descrption of data and also contain meny columns,less rows It contains primary key. What is the use of update strategy transformation. To flag source records as INSERT, DELETE, UPDATE or REJECT for target database. Default flag is Insert. This is must for Incremental Data Loading. what is meant by complex mapping? Complex maping means involved in more logic and more business rules. for ex, in a bank project, I involved in construct a 1 dataware house Many customera r there in my bank project, They r after taking loans relocated in to another place that time i feel to diffcult maintain both prvious and current adresses in the sense i am using scd2 This is an simple example of complex mapping. what is difference b/w Informatica 7.1 and Abinitio? There is a lot of diffrence between informatica an Ab Initio In Ab Initio we r using 3 parllalisim but Informatica using 1 parllalisim In Ab Initio no scheduling option we can scheduled manully or pl/sql script but informatica contains 4 scheduling options Ab Inition contains co-operating system
25
Two relational tables are connected to SQ Trans,what are the possible errors it will be thrown? The only two possibilities could be 1. 2. Both the table should have primary key/foreign key relation ship Both the table should be available in the same schema or same database
How to import oracle sequence into Informatica. heres how u go about it, CREATE ONE PROCEDURE AND DECLARE THE SEQUENCE INSIDE THE PROCEDURE,FINALLY CALL THE PROCEDURE IN INFORMATICA WITH THE HELP OF STORED PROCEDURE TRANSFORMATION. What is IQD file? IQD file is nothing but Impromptu Query Definetion,This file is maily used in Cognos Impromptu tool after creating a imr( report) we save the imr as IQD file which is used while creating a cube in power play transformer.In data source type we select Impromptu Query Definetion. Can Informatica load heterogeneous targets from heterogeneous sources? Yes it can. For exampleFlat File and Relations sources are joined in the mapping, and later, Flat File and relational targets are loaded. In realtime which one is better star schema or snowflake star schema? In real time only star schema will implement because it will take less time and surrogate key will there in each and every dimension table in star schema and this surrogate key will assign as foreign key in fact table. Where is the cache stored in informatica? Cache is stored in the Informatica server memory and over flowed data is stored on the disk in file format which will be automatically deleted after the successful completion of the session run. If you want to store that data you have to use a persistant cache. In a joiner trasformation, you should specify the source with fewer rows as the master source. Why? In joinner transformation informatica server reads all the records from master source builds index and data caches based on master table rows.after building the caches the joiner transformation reads records from the detail source and perform joins. how do we remove the staging area?
26
27
28
29
Is it possible to run one loading session with one particular target and multiple types of data sources? Yes, Use joiner transformation to join heterogeneous sources Develop the mapping In the Session enter the respective locations or details of the sources
You transfer 100000 rows to target but some records get discard. How wil you trace them? and where its get loaded? In the session - target properties, the last two attributes reject file directory and reject filename tells where the bad file (file with rejected records) is located. How many types of flatfiles available in Informatica? Monday, November 27th, 2006 There are two types of flate files: 1.Delimtedwidth
30
31
32
What is Session and Batches? Session - A Session Is A set of instructions that tells the Informatica Server How And When To Move Data From Sources To Targets. After creating the session, we can use either the server manager or the command line program pmcmd to start or stop the session.Batches - It Provides A Way to Group Sessions For Either Serial Or Parallel Execution By The Informatica Server. There Are Two Types Of Batches :Sequential - Run Session One after the Other.Concurrent - Run Session At The Same Time. How many ways you can update a relational source defintion and what r they?
33
34
35
36
37
What is the default join that source qualifier provides? The Joiner transformation supports the following join types, which you set in the Properties tab: * Normal (Default) * Master Outer * Detail Outer * Full Outer What is the target load order? U specify the target load order based on source qualifiers in a maping.If u have the multiple source qualifiers connected to the multiple targets,U can designatethe order in which informatica server loads data into the targets. What r the tasks that source qualifier performs? # Join data originating from the same source database. You can join two or more tables with primary-foreign key relationships by linking the sources to one Source Qualifier. # Filter records when the Informatica Server reads source data. If you include a filter condition, the Informatica Server adds a WHERE clause to the default query. # Specify an outer join rather than the default inner join. If you include a user-defined join, the Informatica Server replaces the join information specified by the metadata in the SQL query. # Specify sorted ports. If you specify a number for sorted ports, the Informatica Server adds an ORDER BY clause to the default SQL query. # Select only distinct values from the source. If you choose Select Distinct, the Informatica Server adds a SELECT DISTINCT
38
what are the UTPS? Utps are done to check the mappings are done according to given business rules.utp is the (unit test plan ) done by deveploper. what are the transformations that restrict the partitioning of sessions? *Advanced External procedure transformation and External procedure transformation: This Transformation contains a check box on the properties tab to allow partitioning. *Aggregator Transformation: If you use sorted ports you cannot partition the associated source *Joiner Transformation: you can not partition the master source for a joiner transformation *Normalizer Transformation *XML targets. How do u import VSAM files from source to target? In mapping Designer we have direct option to import files from VSAM Navigation : Sources => Import from file => file from COBOL. What is difference between IIF and DECODE function? You can use nested IIF statements to test multiple conditions. The following example tests for various conditions and returns 0 if sales is zero or negative: IIF( SALES > 0, IIF( SALES < 50, SALARY1, IIF( SALES < 100, SALARY2, IIF( SALES < 200, SALARY3, BONUS))), 0 ) You can use DECODE instead of IIF in many cases. DECODE may improve readability. The following shows how you can use DECODE instead of IIF : SALES > 0 and SALES < 50, SALARY1, SALES > 49 AND SALES < 100, SALARY2, SALES > 99 AND SALES < 200, SALARY3, SALES > 199, BONUS) How do you decide whether you need to do aggregations at database level or at Informatica level?
39
40
41
42
43
In my source table 1000 recs r there.I want to load 501 rec to 1000 rec into my Target table ?how can u do this ? In db2 we write statement as fetch first 500 rows only.in informatica we can do by using sequence generator and filter out the row when exceds 500. or You can overide the sql Query in Wofkflow Manager. LIke select * from tab_name where rownum<=1000 minus select * from tab_name where rownum<=500; how is the union transformation active transformation? Active Transformation: the transformation that change the no. of rows in the Target.
44
45
46
What r the tasks that Loadmanger process will do? Manages the session and batch scheduling: Whe u start the informatica server the load maneger launches and queries the repository for a list of sessions configured to run on the informatica server.When u configure the session the loadmanager
47
48
What is the default join that source qualifier provides? The Joiner transformation supports the following join types, which you set in the Properties tab: * Normal (Default) * Master Outer * Detail Outer * Full Outer What is the target load order? U specify the target loadorder based on source qualifiers in a maping.If u have the multiple source qualifiers connected to the multiple targets,U can designatethe order in which informatica server loads data into the targets. What r the tasks that source qualifier performs? # Join data originating from the same source database. You can join two or more tables with primary-foreign key relationships by linking the sources to one Source Qualifier. # Filter records when the Informatica Server reads source data. If you include a filter condition, the Informatica Server adds a WHERE clause to the default query. # Specify an outer join rather than the default inner join. If you include a user-defined join, the Informatica Server replaces the
49
50