one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / reddymkl.dwh
Job Design:
Agg--->Filter1---------->|
| | Unique
file-->cp-------------------->Join---->Filter2---->target1
|
|-->Duplicate
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Cnt (User defined).
Filter1: give the condition Where=Cnt=1
U will get unique values like 200,400,500,600
Use Join (Or) Lookup stage: select left outer join
Filter2:
Where=Column_name='' (Duplicate values like 100,100,300,300,300)
Where=Column_name<>'' (Unique Values like 200,400,500,600)
u will get the right output....what ever the duplicate records.
Plz correct me if am wrong.....
Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
Differentiate between operational datastage (ods) and data warehouse?
Can you explain kafka connector?
How do you schedule or monitoring the job?
1.new record it will insert but changes of natural key is not present in taget i want to update (here key is composite natural key )can any one help this to explan how to do
What is merge stage?
What are the primary usages of datastage tool?
What is use Array size in datastage
What are the differences between datastage and informatica?
Is it possible to query a hash file?
how to implement scd2 in datastage 7.5 with lookup stage
How to write a expression to display the first letter in Caps in each word using transformer stage ? Please let me know ASAP Thanks in advance...
What is the use of hoursfromtime() function in transformer stage in datastage?
What are the main features of datastage?
1)s.key generate 1 to 700 records today. tomorrow another 400 will updated how to update the records using s.key generator? 2)source is like :-- DB --> T/F stage1 --> seq1file T/f 1 is linking with T/F2 ---> seq 2 how to load the data? in source i given some conditions those r going in seq1. The another data will going to seq2 how to do this ?
Is it possible to implement parallelism in Mainframe Jobs ? If Yes how ? If no why ?