when will you go for dataset and fileset?
Answers were Sorted based on User's Feedback
Answer / jaimy chacko
just want to add something to the previous answer
Data sets are operating system files,
each referred to by a control file, which by
convention has the suffix .ds.
The control file points IBM InfoSphere DataStage
to a set of other files that carry the data.
The location of these data files is determined by the “resource disk” property in the configuration file used to run the job. Using data sets wisely can be key to good performance in a set of linked jobs.
You can also manage data sets independently of a job using the Data Set Management utility, available from
the IBM InfoSphere DataStage and QualityStage Designer or Director.
WebSphere DataStage can generate and name exported files, write them to their destination, and list the files it has generated in a file whose extension is,by convention,.fs.The data files and the file that lists them are called a file set.This capability is useful because some operating systems impose a 2 GB limit on the size of a file and you need to distribute files among nodes to prevent overruns. The amount of data that can be stored in each destination data file is limited by the characteristics of the file system and the amount of free disk space available.
| Is This Answer Correct ? | 8 Yes | 4 No |
Answer / jaimy chacko
Dataset and Fileset are almost the same .
Dataset is tool dependent and File Set is OS dependent(UNIX).
Dataset don't have any regulation of the amount of data that it has where as Fileset have limits to the data.
| Is This Answer Correct ? | 3 Yes | 5 No |
1.How to read multiple file from sequential stage. 2.If a file doesn't arrive or doesn't exists in sequential stage how do u handle this. 3.What do you do before taking data from source to staging area. 4. I have a remove duplicate stage and transformer stage.what will u do for optimizeing the performance of the job.
How do you run datastage job from the command line?
what is use of SDR function?
i have one table with one column in this column i have three rows like 1,1,2 then that rows populate to target as first tow rows as one row and remaing row as one row how it posible? COLUMN_NAME SHIVA RAMU MADHU THEN I WANT TO LIKE SHIVA AND RAMU IN ONE ROW AND MADHU IS ONE ROW IF ANY ONE KNOW PLZ TELL ME
there are two schemas x and y are there. some data is in x schema. i want to use that in y schema..how can i use? please give some possibilities
i have a job with 3 possibiliies finish with ststus ok abbort warning how do you handle these 3 in job sequence? explain the situation where you have appiled scd in your project? chandu
what is a message handler
what is executable in DataStage?
how can we extract data with out having any common column
what is combinability and non combinability?
specify data stage strength?
Please Tell me..,I need some Complex Jobs in datastage as per in interview?how to tell and how to explain?please send as well as posiible please