Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


what is the exact difference between dataset and fileset in
datastage?

Answers were Sorted based on User's Feedback



what is the exact difference between dataset and fileset in datastage?..

Answer / subhash

DataSet:
1. The fundamental concept of the Orchestrate
framework is the Data Set. Data Sets are the inputs and
outputs of Orchestrate operators.
2. As a concept a Data Set is like a database table,
in so far as it is a collection of identically-defined
rows. It is the only structure on which Orchestrate
operators operate. Each operator( i.e., stage) accepts
input from one Data Set and sends its output to another
Data Set.
3. A Data Set exists on all the processing nodes
defined for the job that is currently processing it. That
subset of rows in a Data Set that are located on a single
processing node is referred to as a "partition" of the Data
Set. Technically, a partition is a subset of the rows in a
Data Set (or File Set) earmarked for processing on the same
processing node.
4. A control file is associated with each data set.
The control file contains the record schema that defines
the row structure (effectively its column definitions).
5. Within a Data Set data are stored in internal, or
machine-compatible format.

FileSet:
1. It allows you to read data from or write data to a
file set.
2. The stage can have a single input link, a single
output link and a single reject link.
3. It only executes in parallel mode.
4. The data files and the file that lists them are
called a file set. This capability is useful because some
operating systems impose a 2 GB limit on the size of a file
and you need to distribute files among nodes to prevent
overruns.
5. Only advantage of using fileset over a sequential
file is "it preserves partitioning scheme"

A dataset is a file/stage where the data can be read
directly by the DataStage, whereas a file set needs to be
converted into DataStage readable format (which happens
internally).

In simple words the data from the DataSet can be read
faster than from FileSet.

Is This Answer Correct ?    21 Yes 4 No

what is the exact difference between dataset and fileset in datastage?..

Answer / peddolla

1) dataset in native format so it can view the data only internally(datastage) where as fileset is in binary format so data can be view in any where which is convert from binary to human understandable language.

2) dataset dont support reject link where as fileset support reject link.

3) dataset is copy operator fileset is import and export operator.

Is This Answer Correct ?    13 Yes 2 No

what is the exact difference between dataset and fileset in datastage?..

Answer / kavi

In DataSet, data is stored in Binary format.
In fileSet, data is stored in the form of text.
That's it...

Is This Answer Correct ?    10 Yes 12 No

what is the exact difference between dataset and fileset in datastage?..

Answer / lokesh butra

Dataset operate the file local server and also its support
upto 2 GB Data
File set operates the file local and remote servers and
also its support unlimited Data

Is This Answer Correct ?    2 Yes 7 No

what is the exact difference between dataset and fileset in datastage?..

Answer / prakash

Dataset is same as that of fileset only difference is reject
link and external use.

Is This Answer Correct ?    7 Yes 13 No

Post New Answer

More Data Stage Interview Questions

How to add zero "0" before record in a field?

4 Answers  


Pls tell me what is troubleshooting in datastage view?

1 Answers  


col1 123 abc 234 def jkl 768 opq 567 789 but i want two targetss target1 contains only numeric values and target2 contains only alphabet values like trg1 123 234 768 567 789 trg2 abc def jkl opq

9 Answers   Patni,


what is the use of surogate key in datastage

0 Answers   TCS,


how to handle null values in sequential file?

1 Answers   HCL,


What are the various kinds of the hash file?

0 Answers  


source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TARGET2 TARGET2 235

6 Answers   US Technology,


input Name Salay Sam 10 Sam 30 Sam 20 Ram 40 Ram 50 Output should be Name Salary Count Sam 10 3 Ram 40 2 get min salary and count of name . OutPut 2 Name Seq Sam 1 Sam 2 Sam 3 Ram 1 Ram 2

1 Answers   HCL,


Define Merge?

0 Answers  


parallel jobs run on cluster machines. server jobs run on smp and mpp. What do mean by cluster mechines and SMP and MPP..?

3 Answers  


Hi, Please tell me how to solve this scenario in datastage ? Here we have 3 columns in a table TEST CODE,ENTRY DATE and BATCH The table looks like CODE ENTRYDATE BATCH 100 100716 1 100 100716 1 100 100716 1 200 122517 2 200 122517 2 302 555555 8 302 555555 8 302 555555 8 We need to create a seqno on grouping these 3 columns. The result should be like this. CODE ENTRYDATE BATCH SEQNO 100 100716 1 1 100 100716 1 2 100 100716 1 3 200 122517 2 1 200 122517 2 2 302 555555 8 1 302 555555 8 2 302 555555 8 3

1 Answers   Alpharithm Technologies,


Hi, i did what you mentioned in the answer, i.e. source- >Transformer -> 3 datasets. Iam able to see the data in datasets but its not sort order... Can you tell how sort the data?? i also checked Hash partition with performsort.

1 Answers   CGI,


Categories