Job Design: |----->Agg--->Filter1-->| |

one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?

Question Posted / reddyvaraprasad

8 Answers
10795 Views
IBM, I also Faced
E-Mail Answers

Answer Posted / reddyvaraprasad

Job Design:

|----->Agg--->Filter1-->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target

Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined).

Filter1: give the condition Count<>1

Join: select left outer join

Filter2: give the condition Count<>0

u will get the right output....what ever the duplicate records.

and if u want unique records, give the condition Count=0

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

Difference between IBM DATA STAGE8.5 and DATA STAGE9.1 ?

3378

Describe the architecture of datastage?

569

What is the roundrobin collector?

636

What is "fatal error/rdbms code 3996" error?

663

What are the types of containers in datastage?

703

Define orabulk and bcp stages?

678

What is apt_config in datastage?

702

how many rows sorted in sort stage by default in server jobs

3554

How do you register plug-ins?

600

What is the project in datastage?

666

Why fact table is in normal form?

673

Terminate Activity

838

how to implement scd2 in datastage 7.5 with lookup stage

5128

Differentiate between Join, Merge and Lookup stage?

645

Define Merge?

874