firstly read the file into database or sequential file and
make sure the file has company as a column in this file and
give its output to aggregator and in aggregator give
group=company and column to calculate=sal and maximum value
for output column=maxsal and do mapping to the dataset and
you will get the required result.
Frist using Sortstage sort the records based on sal column
for desc order and Each record unqiely identify using
surogatekey after that filterstage filter the first record
based on sur column and drag and drop only in sal column
in the output.Then We get the Max sal
If we use Aggregator stage, it will give the max value from
different groups, as per your example, it will group by
company, then it will give the max sal from each group. but
how to max sal from all the groups, i mean the out put
should be only one value.
Create one dummy key and set the value as 1 for all input
columns and use remove duplicate stage. Here use this dummy
as key and use sorting, partitioning for dummy and company
columns & use sorting (descending) for salary column and in
properties tab, select duplicate to retain as last.
Unix Qn asked in datastage interview:
I have diff type(.txt, .tmp, .bat etc) of file in 4 diff directories, I want move all '.txt' file from 4 directories to other folder.
And need to delete all the files except which are created TODAY?
In my previous project we get data from mainframe and load it
into datastage DB2 tables.Sometimes we get data as flat file
or a mainframe tables itself directly we fetch the data.Is
this a migration project?