If there is a file that contains 1000 records, I need the ouput to contain these 1000 records with the header as file name concatenated with the current timestamp and trailer as the count of records
Answers were Sorted based on User's Feedback
Answer / raj
Method 1:
In Order to get this , the file name has to be parameterized.
Row Generator ->
Source -> Column Generator -> Funnel -> Destination
Aggregator ->
Column Generator:
Create column named 'Count' starting with 1 and incrementing by 1. By this, the last row will have the value of1000 in the 'Count' column.
Aggregator:
Get max of count which is 1000.
We also pass the actual data through another link.
Aggregator count is passed into funnel which is the trailer count.
The three inputs
1. File name concatenated with timestamp is passed from row generator.
2. Actual data is passed directly after column generator.
3. Header count is passed through the third link.
are joined in a funnel stage.
As funnel needs same number of columns across all the input we set all other columns as NULL.
Method 2...
This is an easy method using after job sub routine.
After the job is complete run a after job sub routine to get record count from any link and append to the file
Similar operation could be done for the header record.
| Is This Answer Correct ? | 0 Yes | 0 No |
Can you let us know what you have tried?
www.datagenx.net
http://goo.gl/atN4AT
| Is This Answer Correct ? | 0 Yes | 1 No |
Highlight the main features of datastage?
Give example for FACTLESS FACT TABLE in real-time scenario.
This is UNIX question asked in DataStage Interview. Say I have n numbers of records in a text file. I want first 3 records in 1st file, last three records in 3rd file and remaining n-6 records in 2nd file. (Note: we don't know how many records are there in the File. I am getting one file on daily basis and I want three target files as asked above)
hi this is kiran i have one table i want divide the table with two different table like even rows and odd rows how can i do this one tell me plzz
Differentiate between Symmetric Multiprocessing and Massive Parallel Processing?
What is the difference between validated and compiled in the datastage?
how can u find out the datastage job is running on how many nodes
Can you explain engine tier in information server?
what is the difference between lookup stage reject link and merge stage reject link in datastage Parallel jobs?
souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM pooja TCS if first row will be repeat i want the result like this name company count krish IBM 1 pooja TCS 1 nandini WIPRO 1 krish IBM 2 pooja TCS 2
1.what is stagearea?what is stage variable? 2.this is my source source:id, name target:id, name 100, murty 100,madan we have three duplicate records for the id column,how can we getthe source record? 100,madan 100,saran
eno ename 1 qaz 1 wsx 1 edc 2 zxc 2 asd 3 qwe 3 wer 3 tru 4 rgj Output: eno ename count 1 qaz,wsx,edc 3 2 zxc,asd 2 3 qwe,wer,tru 3 4 rgj 1 I want the above output to be solved by DataStage as well and I have to write SQL query for the same output.