when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....
Answer Posted / phani kumar
Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.
Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.
Correct me if iam wrong..........
Thanks and regards....
Phani kumar
| Is This Answer Correct ? | 8 Yes | 0 No |
Post New Answer View All Answers
Where the datastage stored his repository?
How a server job can be converted to a parallel job?
How will you move hashed file from one location to another location?
How and where you used hash file?
Triggers,VIEW,Procedures
how to add a new records into source?
project Steps,hits, Project level HArd things,Solved methods?
how to export or import the jobs in .ISX file
Source has 2 columns: USA,NewYork INDIA,MUMBAI INDIA,DELHI UDS,CHICAGO INDIA,PUNE i want data in target like below: INDIA,MUMBAI1 INDIA,DELHI2 INDIA,PUNE3 USA,NEWYORK1 USA,CHICAGO2
Does datastage support slowly changing dimensions ?
What are the some differences between 7.x and 8.x version of datastage?
What is a ds designer?
how do u catch bad rows from OCI stage? And what CLI stands for?
What are the main differences you have observed between 7.x and 8.x version of datastage?
What is a merge?