one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / pooja
Follow the following steps -
1. Seq file stage - Read the input data in seq file - input1.txt
2. Aggregate stage - count the number of rows (say CountRow) for each ID(group=ID)
3. Filter stage - Filter the data where CountRow<>1
4. Perform join on the output of the step 3 and input1.txt.
You will get the result :)
Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
Which algorithm you used for your hashfile?
Describe the main features of datastage?
how to connect source to db?generally what r stages u used? how to find the data is having delimiter format?
Why do we use link partitioner and link collector in datastage?
What is process model?
Describe stream connector?
Can we use target hash file as a lookup ?
how to read 100 records at a time in source a) hw is it fr metadata Same and b) if metadata is nt same?
What are some different alternative commands associated with "dsjob"?
How complex jobs are implemented in datstage to improve performance?
Differentiate between Join, Merge and Lookup stage?
What are the job parameters?
If we take 2 tables(like emp and dept),we use join stage and how to improve the performance?
What is use Array size in datastage
Does datastage support slowly changing dimensions ?