how can find maximum salary by using Remove duplicate stage?

Categories >> Software >> Data Warehouse >> Data Stage
Suggest New Category

Question Posted / bkarthikmca

5 Answers
12855 Views
IBM, I also Faced
E-Mail Answers

Answers were Sorted based on User's Feedback

how can find maximum salary by using Remove duplicate stage?..

Answer / subbuchamala

For RMD Stage we need Grouping Keys, Add one extra hard coded Column to each Input records in previous stage.
DUMMY=1

In RMD Stage,
HASH partition on DUMMY, Sort DESC on Salary

and select DUMMY as Key in properties tab.
"Duplicate to Retain"=First.

then you will get only Highest Salart only from RMD Stage

Is This Answer Correct ?

5 Yes

0 No

how can find maximum salary by using Remove duplicate stage?..

Answer / narayana.3ds

R.D Stage remove the duplicates every one knows but u can
find max salary......
Give the key of salary order by descending other
option u can put=first

Is This Answer Correct ?

5 Yes

2 No

how can find maximum salary by using Remove duplicate stage?..

Answer / phani kumar

By using Sort stage, sort the data in Descending order, then
use remove duplicates stage, set option as first occurrence
of records. Then we will get max sal record first.

Is This Answer Correct ?

2 Yes

1 No

how can find maximum salary by using Remove duplicate stage?..

Answer / akila

In case an explicit sort stage is used like inteh reply
above, the sort will be done on emp_id(key field) and the
salary. But remove duplicate will be done only on emp_id. I
guess that this might result in a warning like 'sort and
remove duplicates keys do not match'.

Is This Answer Correct ?

0 Yes

0 No

how can find maximum salary by using Remove duplicate stage?..

Answer / anitha

https://www.docstoc.com/pro/2134166

Is This Answer Correct ?

1 Yes

4 No

Post New Answer

More Data Stage Interview Questions

State the difference between an operational datastage and a data warehouse?

0 Answers

Drop duplicate records ... SOURCE LIKE .......... ID flag1 flag2 100 N Y 100 N N 100 Y N 101 Y Y 101 N Y 102 Y N 103 N N 104 Y Y 105 N N 106 N Y 102 N Y 105 Y Y in above file if any id having both the flags as "N" then that corresponding id records should be dropped, in above case o/p should be as ID flag1 flag2 101 Y Y 101 N Y 102 Y N 102 N Y 104 Y Y 106 N Y Steps to do : 1) Identified the id’s that got duplicated (both the flag values having vales “N”) 2) Look up with these id’s to existing id’s to drop .

2 Answers

my soure table is emp having columns sal,deptno in the deptno 10,20,30deptno row are there expected out put is min(sal) of 10th deptno,max(sal) of 20th deptno,mean(sal) of 30th deptno using aggregation stage

4 Answers

my source is sequencial file and my target is dataset. i am running the job in two node configuration file. my source having 10 records how the data move to target?

3 Answers TCS,

What are the main differences you have observed between 7.x and 8.x version of datastage?

0 Answers

how to call sequential generator in datastage?

1 Answers IBM,

Explain the ChangeApply stage?

11 Answers

what is advantages of snowflake when it is used?

3 Answers IBM,

What is developer responsibilities in UAT (user acceptance testing and Post implementation phase?

0 Answers

What is "fatal error/rdbms code 3996" error?