what is the diff between sequential file and fileset stages?
Answers were Sorted based on User's Feedback
Answer / aparna kanduri
File set can be stored on multiple Unix files as flat files.
file set contains discriptor file and individual raw data
files . No. of raw data files depends on configuration file.
There will be some limit like 2G limit on some file
systems . we can distribute data over nodes to prevent
overrun.
In such cases file set will be useful than sequential file.
| Is This Answer Correct ? | 11 Yes | 6 No |
Answer / san
Seq. stage:
--------------
Seq file stage by default run in sqe. mode and you define a
seq file stage to run in parallel mode.. by setting option
"number of readers per node" > 1.
Seq. FS is used to read or write data from .txt .dat format
files. But the limit of seq stage is 2 GB.
Fileset stage:
---------------
by default Fileset stage run in parallel mode. more than 2GB
data can be stored. But the fileset hold to typse of information
i) files descriptor -> points to metadata, data location
ii) contains the data, contains multiple files if your using
more than one nodes config file
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / venugopal [patni]
Sequential file is used to read the data sequentially.
It can be configured to execute both in parallel and
sequential mode.We cann't perform lookups using sequential
file.
Fileset stage is used to import the exported files list.
It executes only in parallel mode.
the most importance of it is 2G.B limit on the size of a
file and we need to distribute files among the nodes to
prevent overruns.
| Is This Answer Correct ? | 7 Yes | 8 No |
Source Like department_no, employee_name ---------------------------- 20, R 10, A 10, D 20, P 10, B 10, C 20, Q 20, S and Output should be like this department_no, employee_list -------------------------------- 10, A 10, A,B 10, A,B,C 10, A,B,C,D 20, A,B,C,D,P 20, A,B,C,D,P,Q 20, A,B,C,D,P,Q,R 20, A,B,C,D,P,Q,R,S
I have a few records just I want to store data in to targets cycling way how?
Hi guys, In sequencer job, we have 3 sources, in that 1st source have some records, Here requirement is 1st source records are 100 then only run the job otherwise total job will abort... How to calculate this. please design the job. Thanks.
Define APT_CONFIG in Datastage?
How do u view log file for a particular job?
How do you run datastage job from the command line?
Why we use surrogate key?
hi this is kiran i have one table i want divide the table with two different table like even rows and odd rows how can i do this one tell me plzz
Which algorithm you used for your hashfile?
how do u catch bad rows from OCI stage? And what CLI stands for?
What is a folder? Difference types of stages?
How did you reconcile source with target?