How do we eliminate duplicate records in a flat file without using Sorter and Aggregator?
Answer Posted / priyank
There are several ways of achieving this. We can do it
through expression transformation and other is look up on
the target.
Expression transformation:
Create ports,
Var_PREV_KEY=Key
Var_CURR_KEY=Var_PREV_KEY
Var_CHK_DUPLICATE --> IIF(Var_CURR_KEY=Key,'DUP','NODUP')
OUT_DUPLICATE --> Var_CHK_DUPLICATE
Note: I have taken a scenario where the target table
contains only 1 Key. In case of multiple keys, will have to
create a few more Variable ports for both CURR and PREV and
in the Var_CHK_DUPLICATE port, we need to add those checks
with an 'AND' operator.E.g. For 2 keys,
Var_PREV_KEY1=Key1
Var_CURR_KEY1=Var_PREV_KEY1
Var_PREV_KEY2=Key2
Var_CURR_KEY2=Var_PREV_KEY2
Var_CHK_DUPLICATE --> IIF(Var_CURR_KEY1=Key1 AND
Var_CURR_KEY2=Key2,'DUP','NODUP')
OUT_DUPLICATE --> Var_CHK_DUPLICATE
If the Informatica version is Unix installation, then in
the pre session command you can give an unix command to
remove the duplicates from the file like
sort <file_name> | uniq > <file_name>.new
Hope it helps.
| Is This Answer Correct ? | 4 Yes | 12 No |
Post New Answer View All Answers
what are 3 tech challenges/ common issues you face?
If I have 10 flat files with same name abc.txt files with different timestamps as source I need to load them in tgt table oracle. in between job execution fails and rows are not loaded into tgt. how can I make them load in that target even if my job fails?
I have two different source structure tables, but I want to load into single target table? How do I go about it? Explain in detail through mapping flow.
How do you load first and last records into target table?
Explain how to import oracle sequence into informatica?
what is INFORMATICA TESTING process
How to update source definition?
What is the use of an aggregator cache file?
What are the transformations that cannot be placed between the sort origin and the joiner transformation so that we do not lose the input sort order?
What is the Rank index port in Rank transformation?
How can one identify whether mapping is correct or not without connecting session?
Explain the tuning lookup transformation - informatica
Define the various join types of joiner transformation?
what is diff b/t sorter t/r and agg have the option sorter property...
What are the validations that you can perform in oracle and in informatica ? Advance Thanks