one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?

Answer Posted / prasad

>Agg--->Filter1------->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target1
|
|
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined)

Count
------------
100--2
200--1
300--3
400--1
500--1
600--1
it will generate in Agg stage then

Filter1: give condition like Count=1( u will get unique records from Filter1)

Join Stage: take Left Outer Join

Filter2:
where=column_name=''(null){u will get duplicates records)

Target1 o/p:
100
100
300
300
300

where= column_name<>''(u will get unique records)

Target2 o/p:

200
400
500
600

Please correct, if am wrong :)

Is This Answer Correct ?    2 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

What is orabulk stage?

617


What is the project in datastage?

653


In Datastage, how you can fix the truncated data error?

632


Terminate Activity

829


What is the purpose of interprocessor stage in server jobs?

1716






What are the differences between datastage and informatica?

560


how to read 100 records at a time in source a) hw is it fr metadata Same and b) if metadata is nt same?

1707


What are the types of containers?

667


What is the flow of loading data into fact & dimensional tables?

581


Differentiate between Join, Merge and Lookup stage?

634


Differentiate between odbc and drs stage?

583


Differentiate between datastage and informatica?

709


Define project in datastage?

657


i have a job with 3 possibiliies finish with ststus ok abbort warning how do you handle these 3 in job sequence? explain the situation where you have appiled scd in your project? chandu

2259


What are constraints and derivations?

637