Job Design: Agg--->Filter1---------->| |

one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?

Question Posted / reddymkl.dwh

8 Answers
10792 Views
IBM, I also Faced
E-Mail Answers

Answer Posted / reddymkl.dwh

Job Design:

Agg--->Filter1---------->|
| | Unique
file-->cp-------------------->Join---->Filter2---->target1
|
|-->Duplicate
Target2

Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Cnt (User defined).

Filter1: give the condition Where=Cnt=1

U will get unique values like 200,400,500,600

Use Join (Or) Lookup stage: select left outer join

Filter2:

Where=Column_name='' (Duplicate values like 100,100,300,300,300)
Where=Column_name<>'' (Unique Values like 200,400,500,600)

u will get the right output....what ever the duplicate records.

Plz correct me if am wrong.....

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

Differentiate between operational datastage (ods) and data warehouse?

672

Can you explain kafka connector?

775

How do you schedule or monitoring the job?

667

1.new record it will insert but changes of natural key is not present in taget i want to update (here key is composite natural key )can any one help this to explan how to do

1649

What is merge stage?

762

What are the primary usages of datastage tool?

623

What is use Array size in datastage

1305

What are the differences between datastage and informatica?

569

Is it possible to query a hash file?

1564

how to implement scd2 in datastage 7.5 with lookup stage

5128

How to write a expression to display the first letter in Caps in each word using transformer stage ? Please let me know ASAP Thanks in advance...

1078

What is the use of hoursfromtime() function in transformer stage in datastage?

582

What are the main features of datastage?

662

1)s.key generate 1 to 700 records today. tomorrow another 400 will updated how to update the records using s.key generator? 2)source is like :-- DB --> T/F stage1 --> seq1file T/f 1 is linking with T/F2 ---> seq 2 how to load the data? in source i given some conditions those r going in seq1. The another data will going to seq2 how to do this ?

1627

Is it possible to implement parallelism in Mainframe Jobs ? If Yes how ? If no why ?

1780