Sort stage is used to sort the data and having option of identifying the d

when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Question Posted / phani kumar

4 Answers
23611 Views
Target, I also Faced
E-Mail Answers

Answer Posted / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?

8 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

Where the datastage stored his repository?

1089

How a server job can be converted to a parallel job?

1096

How will you move hashed file from one location to another location?

2152

How and where you used hash file?

1175

Triggers,VIEW,Procedures

1221

how to add a new records into source?

2000

project Steps,hits, Project level HArd things,Solved methods?

2133

how to export or import the jobs in .ISX file

1203

Source has 2 columns: USA,NewYork INDIA,MUMBAI INDIA,DELHI UDS,CHICAGO INDIA,PUNE i want data in target like below: INDIA,MUMBAI1 INDIA,DELHI2 INDIA,PUNE3 USA,NEWYORK1 USA,CHICAGO2

781

Does datastage support slowly changing dimensions ?

1106

What are the some differences between 7.x and 8.x version of datastage?

1256

What is a ds designer?

1070

how do u catch bad rows from OCI stage? And what CLI stands for?

2800

What are the main differences you have observed between 7.x and 8.x version of datastage?

1089

What is a merge?

1169