Sort stage is used to sort the data and having option of identifying the d

when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Question Posted / phani kumar

4 Answers
22032 Views
Target, I also Faced
E-Mail Answers

Answer Posted / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?

8 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What are some prerequisites for datastage?

617

What are the steps required to kill the job in Datastage?

686

What is difference between join, merge and lookup stage?

638

In Datastage, how you can fix the truncated data error?

640

what is the custome stage in datastage? how can we impliment that one? plz tell me

1912

What are the benefits of datastage?

726

How do u convert the columns to rows in datastage?

694

Differentiate between data file and descriptor file?

619

Give an idea of system variables.

589

im new to this tool im now at project plz tell me step by step process how to design plz help me i wnt to go with exp for job plz give me d proper design and explination

1632

1)s.key generate 1 to 700 records today. tomorrow another 400 will updated how to update the records using s.key generator? 2)source is like :-- DB --> T/F stage1 --> seq1file T/f 1 is linking with T/F2 ---> seq 2 how to load the data? in source i given some conditions those r going in seq1. The another data will going to seq2 how to do this ?

1634

Can you explain kafka connector?

777

Where the datastage stored his repository?

618

Can you explain players in datastage?

709

If you want to use a same piece of code in different jobs, how will you achieve this?

796