how can find maximum salary by using Remove duplicate stage?
Answers were Sorted based on User's Feedback
For RMD Stage we need Grouping Keys, Add one extra hard coded Column to each Input records in previous stage.
DUMMY=1
In RMD Stage,
HASH partition on DUMMY, Sort DESC on Salary
and select DUMMY as Key in properties tab.
"Duplicate to Retain"=First.
then you will get only Highest Salart only from RMD Stage
Is This Answer Correct ? | 5 Yes | 0 No |
Answer / narayana.3ds
R.D Stage remove the duplicates every one knows but u can
find max salary......
Give the key of salary order by descending other
option u can put=first
Is This Answer Correct ? | 5 Yes | 2 No |
Answer / phani kumar
By using Sort stage, sort the data in Descending order, then
use remove duplicates stage, set option as first occurrence
of records. Then we will get max sal record first.
Is This Answer Correct ? | 2 Yes | 1 No |
Answer / akila
In case an explicit sort stage is used like inteh reply
above, the sort will be done on emp_id(key field) and the
salary. But remove duplicate will be done only on emp_id. I
guess that this might result in a warning like 'sort and
remove duplicates keys do not match'.
Is This Answer Correct ? | 0 Yes | 0 No |
State the difference between an operational datastage and a data warehouse?
Drop duplicate records ... SOURCE LIKE .......... ID flag1 flag2 100 N Y 100 N N 100 Y N 101 Y Y 101 N Y 102 Y N 103 N N 104 Y Y 105 N N 106 N Y 102 N Y 105 Y Y in above file if any id having both the flags as "N" then that corresponding id records should be dropped, in above case o/p should be as ID flag1 flag2 101 Y Y 101 N Y 102 Y N 102 N Y 104 Y Y 106 N Y Steps to do : 1) Identified the id’s that got duplicated (both the flag values having vales “N”) 2) Look up with these id’s to existing id’s to drop .
my soure table is emp having columns sal,deptno in the deptno 10,20,30deptno row are there expected out put is min(sal) of 10th deptno,max(sal) of 20th deptno,mean(sal) of 30th deptno using aggregation stage
my source is sequencial file and my target is dataset. i am running the job in two node configuration file. my source having 10 records how the data move to target?
What are the main differences you have observed between 7.x and 8.x version of datastage?
how to call sequential generator in datastage?
Explain the ChangeApply stage?
what is advantages of snowflake when it is used?
What is developer responsibilities in UAT (user acceptance testing and Post implementation phase?
What is "fatal error/rdbms code 3996" error?
how to load meta data
hi i am bhavani, in real time data stage who is the source provide? and how to recevied in developer? Pls send me answer