Follow the following steps - 1. Seq file stage

one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?

Question Posted / pooja

8 Answers
13279 Views
IBM, I also Faced
E-Mail Answers

Answer Posted / pooja

Follow the following steps -

1. Seq file stage - Read the input data in seq file - input1.txt
2. Aggregate stage - count the number of rows (say CountRow) for each ID(group=ID)
3. Filter stage - Filter the data where CountRow<>1
4. Perform join on the output of the step 3 and input1.txt.
You will get the result :)

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is datastage?

1088

Whats difference betweeen operational data stage (ods) and data warehouse?

1117

What is the difference between an operational datastage and a data warehouse?

1155

What is difference between server jobs & parallel jobs?

1039

Which commands are used to import and export the datastage jobs?

1414

What is a ds designer?

1070

What are the functionalities of link collector?

1085

1.new record it will insert but changes of natural key is not present in taget i want to update (here key is composite natural key )can any one help this to explan how to do

2085

How to read multiple files using a single datastage job if files have the same metadata?

1252

How do u convert the columns to rows in datastage?

1195

Explain the datastage parallel extender (px) or enterprise edition (ee)?

1232

What is ibm datastage?

971

Why do we use link partitioner and link collector in datastage?

1139

What are the main features of datastage?

1285

What are the different type of jobs in datastage?

1043