one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / prasad
>Agg--->Filter1------->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target1
|
|
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined)
Count
------------
100--2
200--1
300--3
400--1
500--1
600--1
it will generate in Agg stage then
Filter1: give condition like Count=1( u will get unique records from Filter1)
Join Stage: take Left Outer Join
Filter2:
where=column_name=''(null){u will get duplicates records)
Target1 o/p:
100
100
300
300
300
where= column_name<>''(u will get unique records)
Target2 o/p:
200
400
500
600
Please correct, if am wrong :)
| Is This Answer Correct ? | 2 Yes | 0 No |
Post New Answer View All Answers
What is the command line function to import and export the ds jobs?
What is oci?
What is ibm datastage?
What are the functionalities of link partitioner and link collector?
Define Data Stage?
How will you move hashed file from one location to another location?
What are stage variables, derivations and constants?
how many rows sorted in sort stage by default in server jobs
Differentiate between odbc and drs stage?
What are the various kinds of the hash file?
how to run a sequential file stage in parallel if the stage is used on the TARGET side
What is the flow of loading data into fact & dimensional tables?
What are datastage sequences?
What is the difference between an operational datastage and a data warehouse?
What is datastage?