when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....
Answer Posted / phani kumar
Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.
Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.
Correct me if iam wrong..........
Thanks and regards....
Phani kumar
Is This Answer Correct ? | 8 Yes | 0 No |
Post New Answer View All Answers
What are some prerequisites for datastage?
What are the steps required to kill the job in Datastage?
What is difference between join, merge and lookup stage?
In Datastage, how you can fix the truncated data error?
what is the custome stage in datastage? how can we impliment that one? plz tell me
What are the benefits of datastage?
How do u convert the columns to rows in datastage?
Differentiate between data file and descriptor file?
Give an idea of system variables.
im new to this tool im now at project plz tell me step by step process how to design plz help me i wnt to go with exp for job plz give me d proper design and explination
1)s.key generate 1 to 700 records today. tomorrow another 400 will updated how to update the records using s.key generator? 2)source is like :-- DB --> T/F stage1 --> seq1file T/f 1 is linking with T/F2 ---> seq 2 how to load the data? in source i given some conditions those r going in seq1. The another data will going to seq2 how to do this ?
Can you explain kafka connector?
Where the datastage stored his repository?
Can you explain players in datastage?
If you want to use a same piece of code in different jobs, how will you achieve this?