how can find maximum salary by using Remove duplicate stage?

Answer Posted / subbuchamala

For RMD Stage we need Grouping Keys, Add one extra hard coded Column to each Input records in previous stage.
DUMMY=1

In RMD Stage,
HASH partition on DUMMY, Sort DESC on Salary

and select DUMMY as Key in properties tab.
"Duplicate to Retain"=First.

then you will get only Highest Salart only from RMD Stage

Is This Answer Correct ?    5 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

Explain the importance of surrogate key in data warehousing?

719


if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?

1760


how to get sum of sal based on dept_no and then sum of all sal irrespective of dept_no in same sql. output:- 10, 200(sum of sal for dept_no 10), 5000(sum of all sal)

530


Define Job control?

681


What are the various kinds of the hash file?

667






What is ibm datastage?

609


tell me 5 situations when we r using oracle db stages like orecle connector, oracle enterprise

2710


What are the different type of jobs in datastage?

611


What is the Environment Variable need to Set to TRIM in Project Level?(In transfermer, we TRIM function but I need to impliment this project level using Environment variable)

1905


Highlight the main features of datastage?

619


How do you register plug-ins?

606


What is the difference between an operational datastage and a data warehouse?

640


What is staging variable?

626


Explain how a source file is populated?

768


How the ipc stage work?

680