when will you go for dataset and fileset?
Answers were Sorted based on User's Feedback
Answer / jaimy chacko
just want to add something to the previous answer
Data sets are operating system files,
each referred to by a control file, which by
convention has the suffix .ds.
The control file points IBM InfoSphere DataStage
to a set of other files that carry the data.
The location of these data files is determined by the “resource disk” property in the configuration file used to run the job. Using data sets wisely can be key to good performance in a set of linked jobs.
You can also manage data sets independently of a job using the Data Set Management utility, available from
the IBM InfoSphere DataStage and QualityStage Designer or Director.
WebSphere DataStage can generate and name exported files, write them to their destination, and list the files it has generated in a file whose extension is,by convention,.fs.The data files and the file that lists them are called a file set.This capability is useful because some operating systems impose a 2 GB limit on the size of a file and you need to distribute files among nodes to prevent overruns. The amount of data that can be stored in each destination data file is limited by the characteristics of the file system and the amount of free disk space available.
| Is This Answer Correct ? | 8 Yes | 4 No |
Answer / jaimy chacko
Dataset and Fileset are almost the same .
Dataset is tool dependent and File Set is OS dependent(UNIX).
Dataset don't have any regulation of the amount of data that it has where as Fileset have limits to the data.
| Is This Answer Correct ? | 3 Yes | 5 No |
What can we do with datastage director?
Out of 4 mill records only 3 mill records are loaded to target and then job aborted. How to load only those 1 mill(not loaded records) for next run. This job is not sequential job, it is stand alone parallel job.What are the possibilities available in datastage8.1?
HOw Hash Partion Works Thank you in Advance i have doubts on Hash Partion TEch Could please give me the clear understandable notation example e_id,dept_no 1,10 2,10 3,20 4,20 5,30 6,40 i have TWo Nodes/Three Nodes My questions are: 1).if i select hash key as e_id how Hash partion will distribute the data in to two NOdes/three NOdes 2).if i select hash key as dept_no how Hash partion will distribute the data in to two NOdes/three NOdes sivakumar.katta7@gmail.com
How to read the length of word in unix?
What are the types of views in datastage director?
source file is having 5 records while moving into target it want to be 10 records
sed,awk,head
how to sort two columns in single job in datastage.
convert yyyy mm dd to dd mm yyyy?
How do you import and export the datastage jobs?
what is the Difference Between Datastage Server Edition and Parallel Edition?
cust id,cust quty like 1,101;1,102;1,103 i want output like cust id,cust quty 1 101,102,103 in oracle please write a query in oracle