what is the diff between sequential file and fileset stages?
Answers were Sorted based on User's Feedback
Answer / aparna kanduri
File set can be stored on multiple Unix files as flat files.
file set contains discriptor file and individual raw data
files . No. of raw data files depends on configuration file.
There will be some limit like 2G limit on some file
systems . we can distribute data over nodes to prevent
overrun.
In such cases file set will be useful than sequential file.
Is This Answer Correct ? | 11 Yes | 6 No |
Answer / san
Seq. stage:
--------------
Seq file stage by default run in sqe. mode and you define a
seq file stage to run in parallel mode.. by setting option
"number of readers per node" > 1.
Seq. FS is used to read or write data from .txt .dat format
files. But the limit of seq stage is 2 GB.
Fileset stage:
---------------
by default Fileset stage run in parallel mode. more than 2GB
data can be stored. But the fileset hold to typse of information
i) files descriptor -> points to metadata, data location
ii) contains the data, contains multiple files if your using
more than one nodes config file
Is This Answer Correct ? | 5 Yes | 1 No |
Answer / venugopal [patni]
Sequential file is used to read the data sequentially.
It can be configured to execute both in parallel and
sequential mode.We cann't perform lookups using sequential
file.
Fileset stage is used to import the exported files list.
It executes only in parallel mode.
the most importance of it is 2G.B limit on the size of a
file and we need to distribute files among the nodes to
prevent overruns.
Is This Answer Correct ? | 7 Yes | 8 No |
what are .ctl(control files) files ? how the dataset stage have better performance by this files?
Differentiate between validated and Compiled in the Datastage?
Why do you need stage variables?
difference between server shared container and parallel shared container
i have a table col1 10 20 30 40 10 20 50 my requirement is how to retrive only duplicates like 10 10 20 20 like this how it's possible in SQL?
i want job aborted after some records are loaded into output by using only sequential stage and dataset
what is .dsx files
Nls stands for what in datastage?
how can u connect the client system directly at any time?
How you Implemented SCD Type 1 & Type 2 in your project?
if a column contains data like ram,rakesh,madhan,suraj,pradeep,bhaskar then I want to place names separated by commas in another columns how can we do?
What is the difference between SQl Loader and OCI in datastage?