when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....
Answers were Sorted based on User's Feedback
Answer / prasu
In Duplicate Stages we have more number of optionscompare
to sort while removing duplicates.If you have less number
if data you can go with Sort stage to remove duolicats.If
you have large number of data go for Remove Duplicates
Stage.
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / phani kumar
Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.
Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.
Correct me if iam wrong..........
Thanks and regards....
Phani kumar
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / data master
Sort Stage do Sorting of data and performing Remove
Duplicate records, which will slow the performance of job
(Hence it is better to sort data at database level).
If the data is already sorted than use the Remove Duplicate
Stage to remove duplicate records, Which will give better
performance of job than above situation.
| Is This Answer Correct ? | 3 Yes | 2 No |
Answer / swati
In Remove Duplicate stage you will get only unique records.
In sort Stage you will get both unique and duplicate records based on key change column.
| Is This Answer Correct ? | 1 Yes | 0 No |
I have 2 jobs.I want to ru job B if job A has run 3 times.How can I achieve this through datastage
Hi friends If anybody completed datastage 8.5 certification.Could please guide me what are the topics new in 8.5 certification.
how to remove duplicates in transformer stage by using stage variables?one example?
What are the benefits of datastage?
In my previous project we get data from mainframe and load it into datastage DB2 tables.Sometimes we get data as flat file or a mainframe tables itself directly we fetch the data.Is this a migration project?
input like 2 7 8 9 5 1 7 3 6 output:2 5 6 how to find out this plz explain?
can any 1 put a difference between datastage 7.5 and 8.0 all concepts plz r mail 2 me anu4u09@gmail.com
how to cleansing data
HOW WILL YOU IMPLEMENT SURROGATE KEY IN SCD BY USING SURR_KEY GENERATOR,THE VALUE OF S_KEY SHOULD NOT REPEAT EVEN IF THE JOB IS COMPILED REPEATEDELY?
There are two file are there .1st file contains 5 records and 2nd file contain 10 records in target they want 50 records.how can achieve this
Can you explain kafka connector?
Name the different sorting methods in datastage.