i want send my all duplicate record one tar and all uniq
records one target how we will perfome explain
example:
input data
eid
251
251
456
456
951
985
out put/target1
251
251
456
456
out put/target2
951
985
how we will bring
Answer Posted / anil
Hi,
This can be do easily with loops in transformer stage
Design
--------
Sq File - > Aggregrate - > Transformer - > Unique
|
Duplicate
1. In aggregate, calculate no of rows say counts
2. In transformer , define a stage variable stgTest whose derivation is if counts > 1 then 'Duplicate' Else 'Single'
3. In the loop condition define @iteration < Counts
4. Take a loop variable saying lopNumbers = EID
5. Now for Unique link write constraint as stgTest = 'Single' and output derivation of EID = @iteration.
6. Now for Duplicate link write constraint as stgTest = 'Duplicate' and output derivation of EID = @iteration.
7. Now you will get duplicate values in one link and unique values in another link.
| Is This Answer Correct ? | 0 Yes | 1 No |
Post New Answer View All Answers
How you Implemented SCD Type 1 & Type 2 in your project?
Describe the architecture of datastage?
Is it possible to implement parallelism in Mainframe Jobs ? If Yes how ? If no why ?
How a source file is populated?
file having these input and we have to get 3 output using same job Input 1 1 1 2 3 4 4 4 o/p1 o/p2 o/p3 1 1 2 2 1 3 3 1 4 4 4
if i have two tables table1 table2 1a 1a,b,c,d 1b 2a,b,c,d,e 1c 1d 2a 2b 2c 2d 2e how can i get data as same as in tables? how can i implement scd typ1 and type2 in both server and in parallel? field1 field2 field3 suresh , 10,324 , 355 , 1234 ram , 23,456 , 450 , 456 balu ,40,346,23 , 275, 5678 how to remove the duplicate rows,inthe fields?
Where do the datastage jobs get stored?
Define repository tables in datastage?
hi.... am facing typical problem in every interview " I need some critical scenarios faced in real time" plz help me guys
How we can covert server job to a parallel job?
how to write server Routine coding?
what is ds administrator used for?
Hi All , in PX Job I have passed 4 Parameters and when i run the same job in sequence i dont want to use those parameters , is this possible if yes then how
How many Key we can define in remove duplicate stage?
what are the devoleper roles in real time? plz tell i am new to datastage....