when will you go for dataset and fileset?
Answers were Sorted based on User's Feedback
Answer / jaimy chacko
just want to add something to the previous answer
Data sets are operating system files,
each referred to by a control file, which by
convention has the suffix .ds.
The control file points IBM InfoSphere DataStage
to a set of other files that carry the data.
The location of these data files is determined by the “resource disk” property in the configuration file used to run the job. Using data sets wisely can be key to good performance in a set of linked jobs.
You can also manage data sets independently of a job using the Data Set Management utility, available from
the IBM InfoSphere DataStage and QualityStage Designer or Director.
WebSphere DataStage can generate and name exported files, write them to their destination, and list the files it has generated in a file whose extension is,by convention,.fs.The data files and the file that lists them are called a file set.This capability is useful because some operating systems impose a 2 GB limit on the size of a file and you need to distribute files among nodes to prevent overruns. The amount of data that can be stored in each destination data file is limited by the characteristics of the file system and the amount of free disk space available.
| Is This Answer Correct ? | 8 Yes | 4 No |
Answer / jaimy chacko
Dataset and Fileset are almost the same .
Dataset is tool dependent and File Set is OS dependent(UNIX).
Dataset don't have any regulation of the amount of data that it has where as Fileset have limits to the data.
| Is This Answer Correct ? | 3 Yes | 5 No |
Hi , Can anyone give few examples of scenarios and there corresponding design in datastage..i am new to this tool...confused in design while my manager asking to design the job.. Please post the URL if there..so i can go through it.. Thanks in advance...
Can you explain players in datastage?
col1 123 abc 234 def jkl 768 opq 567 789 but i want two targetss target1 contains only numeric values and target2 contains only alphabet values like trg1 123 234 768 567 789 trg2 abc def jkl opq
how many types of sorting the data in data stage?
How to clean the datastage repository?
Define Data Stage?
How can we read latest records in a text file named file1.txt using seq file stage only? file1 having 100 records in that 5 record sare latest records.How can we read that latest records?
how to clear the source file? 2) diff b/t odbc & Oracle?what is node? 3) explain one complex job u face in ur project
What is the use of datastage director?
A signal has a wavelength of 1 micro min in air.how far can the front of the wave travel during 1000periods?
Name the different sorting methods in datastage.
I have 2 jobs.I want to ru job B if job A has run 3 times.How can I achieve this through datastage