one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?

Answer Posted / reddyvaraprasad

Job Design:

|----->Agg--->Filter1-->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target

Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined).

Filter1: give the condition Count<>1

Join: select left outer join

Filter2: give the condition Count<>0

u will get the right output....what ever the duplicate records.

and if u want unique records, give the condition Count=0

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

Difference between IBM DATA STAGE8.5 and DATA STAGE9.1 ?

3378


Describe the architecture of datastage?

569


What is the roundrobin collector?

636


What is "fatal error/rdbms code 3996" error?

663


What are the types of containers in datastage?

703






Define orabulk and bcp stages?

678


What is apt_config in datastage?

702


how many rows sorted in sort stage by default in server jobs

3554


How do you register plug-ins?

600


What is the project in datastage?

666


Why fact table is in normal form?

673


Terminate Activity

838


how to implement scd2 in datastage 7.5 with lookup stage

5128


Differentiate between Join, Merge and Lookup stage?

645


Define Merge?

874