souce file having the columns like
name company
krish IBM
pooja TCS
nandini WIPRO
krish IBM
pooja TCS
if first row will be repeat i want the result like this
name company count
krish IBM 1
pooja TCS 1
nandini WIPRO 1
krish IBM 2
pooja TCS 2
Answer Posted / ankit gosain
Hi ALL,
Job Design:
SourceSeqFile--->SortStage--->Transformer--->TgtSeqFile
1. In Sort Stage, take two key, name & company and then go
to options and create a keyChange column.
2. In transformer stage, create a stage variable of integer
type (say Var1) and write in it's derivation:
if keyChange=1 then 1 else Var1+1
3. Now create a new column in tgt (say count) and in
transformer, assign that Var1 to the derivation of count.
4. Goto o/p tab of transformer and there sort the data on
count column.
You'll get the desired output.
If you have more queries, you can mail me on
ankitgosain@gmail.com
Cheers,
Ankit :)
| Is This Answer Correct ? | 3 Yes | 0 No |
Post New Answer View All Answers
project Steps,hits, Project level HArd things,Solved methods?
what are the errors,warnings in datastage
What steps should be taken to improve Datastage jobs?
Differentiate between Join, Merge and Lookup stage?
if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?
What is the difference between an operational datastage and a data warehouse?
What is aggtorec restructure operator?
Is the value of staging variable stored temporarily or permanently?
Can you highlight the main features of ibm infosphere information server?
What all the types of jobs you developed?
Is it possible to implement parallelism in Mainframe Jobs ? If Yes how ? If no why ?
Does datastage support slowly changing dimensions ?
Difference between sequential file and data set?
What is the precedence of stage variables,derivations, and constraints?
What is meta stage?