how to remove duplicates in transformer stage by using
stage variables?one example?
Answers were Sorted based on User's Feedback
Answer / ds
In Stage variable:
stage_variable3 <map> stage_variable1
if column=stage_variable1 than 0 else 1 <map>
stage_variable2
column <map> stage_variable3
Put stage_variable2 as constrain to target stage.
| Is This Answer Correct ? | 12 Yes | 2 No |
Answer / venu
if you want to remove duplicates in transformer stage
use one of the partition technic hash partition you can
easily remove duplicatess
| Is This Answer Correct ? | 8 Yes | 2 No |
Answer / peeyush sehgal
sv1=inputlink
sv2=if inputlink=sv3 then 1 else 0
sv3=sv1
| Is This Answer Correct ? | 0 Yes | 2 No |
Answer / prasad
take two stage variables
sV1: Input_column
sV2: if Input_column = sV1 then 0 else 1
and put 'sV1=1' as constraint
Plz correct me, If am wrong.....
| Is This Answer Correct ? | 4 Yes | 7 No |
Answer / amit
using hash partition technique, we can bring duplicate data(based on key columns) in one partition. Then in stage constraints filter out data with setting @inrownum = 1.
This will remove duplicate in transformer stage.
| Is This Answer Correct ? | 0 Yes | 7 No |
Answer / subodh
duplication of transformer stage is removed b7y using a
call by referance and call by value , using we create one
object and no other duplication is done
| Is This Answer Correct ? | 1 Yes | 14 No |
What are the some differences between 7.x and 8.x version of datastage?
Why do we use link partitioner and link collector in datastage?
Hi, what is use of Macros,functions and Routines..? At what situation you are used. If you know the answer please explain it. Thanks.
how to handle null values using transformer stage?
How we can covert server job to a parallel job?
If the job aborted in a sequencer, how can we start that from the previews successful job.
i have data like sam ram sam raj ram I want two targets trgt1 ram sam trgt2 raj how can i do this in datastage?
What is the differentiate between data file and descriptor file?
How do you schedule or monitoring the job?
What is active and passive stage?
What is the difference between server job and parallel jobs?
How can one find bugs in job sequence?