I have 2 files 1st contains duplicate records only, 2nd file contains Unique records.EX:
File1:
1 subhash 10000
1 subhash 10000
2 raju 20000
2 raju 20000
3 chandra 30000
3 chandra 30000
File2:
1 subhash 10000
5 pawan 15000
7 reddy 25000
3 chandra 30000
Output file:-- capture all the duplicates in both file with count.
1 subhash 10000 3
1 subhash 10000 3
1 subhash 10000 3
2 raju 20000 2
2 raju 20000 2
3 chandra 30000 3
3 chandra 30000 3
3 chandra 30000 3
Answer Posted / subbuchamala
File1,File2====Funnel-----Copy=======1st link AGG, 2nd link JOIN----Filter----OutputFile
1. pass the 2 files to funnel stage and then copy stage.
2. from copy stage 1st link to AGG stage, 2nd link to JOIN stage
3. In AGG stage, Group by Key column say ID, NAME take the count and JOIN based on KEY column
4. Filter on COUNT>1 send the output OutputFile
we get desired output
Is This Answer Correct ? | 14 Yes | 0 No |
Post New Answer View All Answers
Different ways a project can be moved to production ?...example ….export-import and Version control.
What are stage variables?
which r the connectors used in san?
if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?
what are the devoleper roles in real time? plz tell i am new to datastage....
If you want to use a same piece of code in different jobs, how will you achieve this?
What is the difference between Datastage 7.5 and 7.0?
What are the types of containers in datastage?
What is the difference between hashfile and sequential file?
How do you schedule or monitoring the job?
Could anyone give brief explanation bout datastage admin
What are iconv and oconv?
Explain connectivity between datastage with datasources?
What is the difference between passive stage and active stage?
What is the difference between datastage and informatica?