how to remove duplicates in transformer stage by using
stage variables?one example?
Answers were Sorted based on User's Feedback
Answer / ds
In Stage variable:
stage_variable3 <map> stage_variable1
if column=stage_variable1 than 0 else 1 <map>
stage_variable2
column <map> stage_variable3
Put stage_variable2 as constrain to target stage.
| Is This Answer Correct ? | 12 Yes | 2 No |
Answer / venu
if you want to remove duplicates in transformer stage
use one of the partition technic hash partition you can
easily remove duplicatess
| Is This Answer Correct ? | 8 Yes | 2 No |
Answer / peeyush sehgal
sv1=inputlink
sv2=if inputlink=sv3 then 1 else 0
sv3=sv1
| Is This Answer Correct ? | 0 Yes | 2 No |
Answer / prasad
take two stage variables
sV1: Input_column
sV2: if Input_column = sV1 then 0 else 1
and put 'sV1=1' as constraint
Plz correct me, If am wrong.....
| Is This Answer Correct ? | 4 Yes | 7 No |
Answer / amit
using hash partition technique, we can bring duplicate data(based on key columns) in one partition. Then in stage constraints filter out data with setting @inrownum = 1.
This will remove duplicate in transformer stage.
| Is This Answer Correct ? | 0 Yes | 7 No |
Answer / subodh
duplication of transformer stage is removed b7y using a
call by referance and call by value , using we create one
object and no other duplication is done
| Is This Answer Correct ? | 1 Yes | 14 No |
What is the difference between Link collector and Funnel Stages?
Can we use sequential file as source to hash file? Have you do it ?if what error it will give?
in source is like seq file in date column have dd-mm-yy dddd-mmmm-yyyy mm-dd-yy yy-dd-mm yy-mm-dd i want to display only yy-dd-mm date formats only in tgt?
Differentiate between Symmetric Multiprocessing and Massive Parallel Processing?
how to design the change capture stage in(data stage parallel jobs) type 2
source file is having 5 records while moving into target it want to be 10 records
Can you explain link buffering?
What are the partitioning techniques available in link partitioner?
With out using Funnel Stage, how to populate the data from different sources to single target
what is parameterset?
How to initialize environment variables?
Is it possible to query a hash file?