one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / prasad
>Agg--->Filter1------->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target1
|
|
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined)
Count
------------
100--2
200--1
300--3
400--1
500--1
600--1
it will generate in Agg stage then
Filter1: give condition like Count=1( u will get unique records from Filter1)
Join Stage: take Left Outer Join
Filter2:
where=column_name=''(null){u will get duplicates records)
Target1 o/p:
100
100
300
300
300
where= column_name<>''(u will get unique records)
Target2 o/p:
200
400
500
600
Please correct, if am wrong :)
Is This Answer Correct ? | 2 Yes | 0 No |
Post New Answer View All Answers
What is orabulk stage?
What is the project in datastage?
In Datastage, how you can fix the truncated data error?
Terminate Activity
What is the purpose of interprocessor stage in server jobs?
What are the differences between datastage and informatica?
how to read 100 records at a time in source a) hw is it fr metadata Same and b) if metadata is nt same?
What are the types of containers?
What is the flow of loading data into fact & dimensional tables?
Differentiate between Join, Merge and Lookup stage?
Differentiate between odbc and drs stage?
Differentiate between datastage and informatica?
Define project in datastage?
i have a job with 3 possibiliies finish with ststus ok abbort warning how do you handle these 3 in job sequence? explain the situation where you have appiled scd in your project? chandu
What are constraints and derivations?