when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Answer Posted / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?    8 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

What are some prerequisites for datastage?

617


What are the steps required to kill the job in Datastage?

686


What is difference between join, merge and lookup stage?

638


In Datastage, how you can fix the truncated data error?

640


what is the custome stage in datastage? how can we impliment that one? plz tell me

1912






What are the benefits of datastage?

726


How do u convert the columns to rows in datastage?

694


Differentiate between data file and descriptor file?

619


Give an idea of system variables.

589


im new to this tool im now at project plz tell me step by step process how to design plz help me i wnt to go with exp for job plz give me d proper design and explination

1632


1)s.key generate 1 to 700 records today. tomorrow another 400 will updated how to update the records using s.key generator? 2)source is like :-- DB --> T/F stage1 --> seq1file T/f 1 is linking with T/F2 ---> seq 2 how to load the data? in source i given some conditions those r going in seq1. The another data will going to seq2 how to do this ?

1634


Can you explain kafka connector?

777


Where the datastage stored his repository?

618


Can you explain players in datastage?

709


If you want to use a same piece of code in different jobs, how will you achieve this?

796