source has 2 fields like
COMPANY LOCATION
IBM HYD
TCS BAN
IBM CHE
HCL HYD
TCS CHE
IBM BAN
HCL BAN
HCL CHE
LIKE THIS.......
AND I WILL GET THE OUTPUT LIKE THIS....
Company loc count
TCS HYD 3
BAN
CHE
IBM HYD 3
BAN
CHE
HCL HYD 3
BAN
CHE
PLZ SEND ME ANSWER FOR THIS QUESTION..........
Answers were Sorted based on User's Feedback
Answer / solomon
2 ways to do it
1- input -> Transformer -> RDP-> output
Sort the input on location, company
do a comparison in Stage Variable with prev record and keep incrementing a counter
pass it via Remove duplicate stage to retain unique records
2- input-> copy-> agrregator-> lookup-> RDP -> output
do the same as above using agg and lookup stage
Is This Answer Correct ? | 1 Yes | 0 No |
Answer / sreepathi
Hi
Use stage variables in Transformer stage i.e IF THEN ELSE
IF ..condition comparing companies and use COUNT() function
in transformer for which field u want count
Is This Answer Correct ? | 2 Yes | 3 No |
Answer / saibal kumar giri
Follow below steps-----
1.Read the data with sequential file or connector stage.
2.Use copy stage to streamline the data into two links
a.One data goes to lookup stage.
b.Second stream goes to aggregator stage.
3.In aggregator stage do a count(LOCATION) group by COMPANY.
you will be having data O/P from AGG as below
COMPANY COUNT
TCS 2
HCL 3
IBM 3
Use this output data as the reference link of lookup
stage.Where you have original source as master data.
4.Do a Inner join by lookup stage.
after lookup you will have data like
COMPANY:HCL LOCATION:HYD COUNT:3
COMPANY:HCL LOCATION:CHE COUNT:3
COMPANY:HCL LOCATION:BAN COUNT:3
COMPANY:IBM LOCATION:CHE COUNT:3
COMPANY:IBM LOCATION:BAN COUNT:3
COMPANY:IBM LOCATION:HYD COUNT:3
COMPANY:TCS LOCATION:BAN COUNT:2
COMPANY:TCS LOCATION:CHE COUNT:2
5.Then use a transformer stage along with stage variables
as below
SV1->link1.COMPANY
SV3-> if SV1=SV2 then SET NULL() else link1.COMPANY
SV2->SV1
Similarly for count also
Cheers!!!!!
Is This Answer Correct ? | 0 Yes | 1 No |
CAN WE DO HALF PROJECT IN PARALLEL JOBS AND HALF PROJECT IN SERVER JOBS?
What are some different alternative commands associated with "dsjob"?
what is 'reconsideration error' and how can i respond to this error and how to debug this
Can you explain repository tables in datastage?
Why we use surrogate key?
Sequential file i have one record,i want 100 records in target?How can we do that?Pls explain me and what stages are there?What logic?
How can i approach to write datastage 7.5 Certification? and how much they will charge for examination .What exactly should i do? Can anyone guide me plz?
Why we use parameters instead of hard code in datastage.
Source have 1000 records and it can have three nodes ok but i want how many records are executed on each node?
What is the Main difference between Lookup Failure and Lookup Not Met? Plz explain with Example.
Can you highlight the main features of ibm infosphere information server?
what is the Difference Between Datastage Server Edition and Parallel Edition?