Input Data is:
Emp_Id, EmpInd
100, 0
100, 0
100, 0
101, 1
101, 1
102, 0
102, 0
102, 1
103, 1
103, 1
I want Output
100, 0
100, 0
100, 0
101, 1
101, 1
Means Indicator should either all ZEROs or all ONEs per
EmpId.
Impliment this using SQL and DataStage both.
Answer Posted / subhash
1. SQL:
SELECT * FROM
( SELECT EmpId, COUNT(*) AS CNT1 FROM EMP GROUP BY
EmpId) E1,
( SELECT EmpId, COUNT(*) AS CNT2 FROM EMP GROUP BY
EmpId, EmpInd) E2,
WHERE E1.EmpID = E2.EmpId AND
E1.CNT1 = E2.CNT2;
2.DataStage:
SRC--->CPY---->JOIN----TFM---TGT
| /
| /
| /
| /
AGG
In AGG, GROUP BY EmpId, calculate CNT and SUM.
JOIN both one copy from CPY and 2nd Aggrigated copy from
AGG.
In TFM, put constraint: IF CNT=SUM, then populate to TGT
then u will get required output.
| Is This Answer Correct ? | 5 Yes | 2 No |
Post New Answer View All Answers
Name the different sorting methods in datastage.
How we can covert server job to a parallel job?
Does datastage support slowly changing dimensions ?
AGGREGATOR default datatype
What are data elements?
Difference between data warehousing and olap?
Differentiate between Join, Merge and Lookup stage?
What can we do with datastage director?
What are some different alternative commands associated with "dsjob"?
What are routines in datastage? Enlist various types of routines.
Why fact table is in normal form?
Have you used Unstructured data?
Can you explain players in datastage?
Describe link sort?
How can we improve performance of data stage jobs?