what is the diff between sequential file and fileset stages?
Answers were Sorted based on User's Feedback
Answer / aparna kanduri
File set can be stored on multiple Unix files as flat files.
file set contains discriptor file and individual raw data
files . No. of raw data files depends on configuration file.
There will be some limit like 2G limit on some file
systems . we can distribute data over nodes to prevent
overrun.
In such cases file set will be useful than sequential file.
| Is This Answer Correct ? | 11 Yes | 6 No |
Answer / san
Seq. stage:
--------------
Seq file stage by default run in sqe. mode and you define a
seq file stage to run in parallel mode.. by setting option
"number of readers per node" > 1.
Seq. FS is used to read or write data from .txt .dat format
files. But the limit of seq stage is 2 GB.
Fileset stage:
---------------
by default Fileset stage run in parallel mode. more than 2GB
data can be stored. But the fileset hold to typse of information
i) files descriptor -> points to metadata, data location
ii) contains the data, contains multiple files if your using
more than one nodes config file
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / venugopal [patni]
Sequential file is used to read the data sequentially.
It can be configured to execute both in parallel and
sequential mode.We cann't perform lookups using sequential
file.
Fileset stage is used to import the exported files list.
It executes only in parallel mode.
the most importance of it is 2G.B limit on the size of a
file and we need to distribute files among the nodes to
prevent overruns.
| Is This Answer Correct ? | 7 Yes | 8 No |
What is a merge?
Where the datastage stored his repository?
1.i have 5 jobs(1-5),i connect with each other,i want run from 3-5 only how? 2.how to schedual the job in datastage7.5 2? what is the deff bet grip and fgrep command? how do you cleanse the data in your project
how can you generate sequence number using only the sequential file stage?
What is the difference between server job and parallel jobs?
how to achieve this output ? Two Input columns(ID & Name) - ID | Name 1 | Jack 1 | Kara In output there should be only 1 column which will be populated as - 1,Jack 1,Kara
how to retrive the max(sal),deptno,empno in datastage?
How to find value from a column in a dataset?
How can you find out whether datastage process is running or not in unix?
i have a scenario with i/p as ID,salary with values 1,1000 2,2000 and 3,4000 i need an extra column in the o/p named amount with values 2000,4000 and NULL. how can i get it?
I have a few records just I want to store data in to targets cycling way how?
Can you explain engine tier in information server?