when will you go for dataset and fileset?
Answer Posted / jaimy chacko
just want to add something to the previous answer
Data sets are operating system files,
each referred to by a control file, which by
convention has the suffix .ds.
The control file points IBM InfoSphere DataStage
to a set of other files that carry the data.
The location of these data files is determined by the “resource disk” property in the configuration file used to run the job. Using data sets wisely can be key to good performance in a set of linked jobs.
You can also manage data sets independently of a job using the Data Set Management utility, available from
the IBM InfoSphere DataStage and QualityStage Designer or Director.
WebSphere DataStage can generate and name exported files, write them to their destination, and list the files it has generated in a file whose extension is,by convention,.fs.The data files and the file that lists them are called a file set.This capability is useful because some operating systems impose a 2 GB limit on the size of a file and you need to distribute files among nodes to prevent overruns. The amount of data that can be stored in each destination data file is limited by the characteristics of the file system and the amount of free disk space available.
Is This Answer Correct ? | 8 Yes | 4 No |
Post New Answer View All Answers
how to write server Routine coding?
How do you schedule or monitoring the job?
In Informatica,for the table I can find coreesponding dependent mappings.Likewise can I find the dependent jobs with all the information by using the table name
How can we perform the 2nd time extraction of client database without accepting the data which is already loaded in first time extraction?
Why we use surrogate key?
what is the difference between == and eq in UNIX shell scripting?
What are orabulk and bcp stages?
Give an idea of system variables.
what is 'reconsideration error' and how can i respond to this error and how to debug this
Where the datastage stored his repository?
What is meta stage?
Does datastage support slowly changing dimensions ?
Differentiate between operational datastage (ods) and data warehouse?
What is the importance of the exception activity in datastage?
What are the different common services in datastage?