when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Answers were Sorted based on User's Feedback



when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / prasu

In Duplicate Stages we have more number of optionscompare
to sort while removing duplicates.If you have less number
if data you can go with Sort stage to remove duolicats.If
you have large number of data go for Remove Duplicates
Stage.

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / data master

Sort Stage do Sorting of data and performing Remove
Duplicate records, which will slow the performance of job
(Hence it is better to sort data at database level).

If the data is already sorted than use the Remove Duplicate
Stage to remove duplicate records, Which will give better
performance of job than above situation.

Is This Answer Correct ?    3 Yes 2 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / swati

In Remove Duplicate stage you will get only unique records.

In sort Stage you will get both unique and duplicate records based on key change column.

Is This Answer Correct ?    1 Yes 0 No

Post New Answer

More Data Stage Interview Questions

HOW CAN U DO ERROR HANDLING IN DATA STAGE?

8 Answers   ME,


i hav source like this . deptno,sal 1,2000 2,3000 3,4000 1,2300 4,5000 5,1100 i want target like this target1 1,2000 3,4000 4,5000 target2 2,3000 1,2300 5,1100 with out using transformer

2 Answers  


options available in sequence job to run,validate?

0 Answers   CTS,


What is the diff between sort performed at sort stage and the stream sort performed at the input of few stages in DS Enterprise edition?

1 Answers  


What are the features of datastage flow designer?

0 Answers  






1.which index is follows the dimensions tables?why? 2.what is the use of trigger in job sequence? 3.what is the mean of optimization? 4.what is the job control?when we use it? what is difference bet batch and sequencer? 6.seq--->seq,seq--->copy--->seq which one is best and efficient?

1 Answers  


if a column contains data like ram,rakesh,madhan,suraj,pradeep,bhaskar then I want to place names separated by commas in another columns how can we do?

5 Answers  


With out using Funnel Stage, how to populate the data from different sources to single target

12 Answers   Wipro,


What is apt_config in datastage?

0 Answers  


What is the differentiate between data file and descriptor file?

0 Answers  


Emp login_timestamp Logout_timestamp A,2019-02-01 02:24:15,2019-02-01 04:59:42 B,2019-03-29 14:43:30,2019-03-29 20:22:00 ABC,2019-03-29 12:43:00,2019-03-29 23:22:59 In the above calculate the duration of hours spent in office for each emp in datastage.

1 Answers  


how to convert rows into columns

2 Answers   IBM,


Categories