ALLInterview.com :: Home Page            
 Advertise your Business Here     
Browse  |   Placement Papers  |   Company  |   Code Snippets  |   Certifications  |   Visa Questions
Post Question  |   Post Answer  |   My Panel  |   Search  |   Articles  |   Topics  |   ERRORS new
   Refer this Site  Refer This Site to Your Friends  Site Map  Bookmark this Site  Set it as your HomePage  Contact Us     Login  |  Sign Up                      
Google
   
 
Categories >> Software >> Data Warehouse >> Data Stage
 
 


 

 
 Teradata interview questions  Teradata Interview Questions (183)
 Business Objects interview questions  Business Objects Interview Questions (751)
 Cognos interview questions  Cognos Interview Questions (842)
 Informatica interview questions  Informatica Interview Questions (1622)
 Crystal Enterprise Suite interview questions  Crystal Enterprise Suite Interview Questions (29)
 Actuate interview questions  Actuate Interview Questions (35)
 Ab Initio interview questions  Ab Initio Interview Questions (168)
 Data Stage interview questions  Data Stage Interview Questions (563)
 SAS interview questions  SAS Interview Questions (551)
 Micro Strategy interview questions  Micro Strategy Interview Questions (36)
 ETL interview questions  ETL Interview Questions (195)
 Data Warehouse General interview questions  Data Warehouse General Interview Questions (215)
Question
what is the exact difference between dataset and fileset in 
datastage?
 Question Submitted By :: Data-Stage
I also faced this Question!!     Answer Posted By  
 
Answer
# 1
DataSet:
1. The fundamental concept of the Orchestrate
framework is the Data Set. Data Sets are the inputs and
outputs of Orchestrate operators.
2. As a concept a Data Set is like a database table,
in so far as it is a collection of identically-defined
rows. It is the only structure on which Orchestrate
operators operate. Each operator( i.e., stage) accepts
input from one Data Set and sends its output to another
Data Set.
3. A Data Set exists on all the processing nodes
defined for the job that is currently processing it. That
subset of rows in a Data Set that are located on a single
processing node is referred to as a "partition" of the Data
Set. Technically, a partition is a subset of the rows in a
Data Set (or File Set) earmarked for processing on the same
processing node.
4. A control file is associated with each data set.
The control file contains the record schema that defines
the row structure (effectively its column definitions).
5. Within a Data Set data are stored in internal, or
machine-compatible format.

FileSet:
1. It allows you to read data from or write data to a
file set.
2. The stage can have a single input link, a single
output link and a single reject link.
3. It only executes in parallel mode.
4. The data files and the file that lists them are
called a file set. This capability is useful because some
operating systems impose a 2 GB limit on the size of a file
and you need to distribute files among nodes to prevent
overruns.
5. Only advantage of using fileset over a sequential
file is "it preserves partitioning scheme"

A dataset is a file/stage where the data can be read
directly by the DataStage, whereas a file set needs to be
converted into DataStage readable format (which happens
internally).

In simple words the data from the DataSet can be read
faster than from FileSet.
 
Is This Answer Correct ?    8 Yes 1 No
Subhash
 
Answer
# 2
In DataSet, data is stored in Binary format.
In fileSet, data is stored in the form of text.
That's it...
 
Is This Answer Correct ?    5 Yes 2 No
Kavi
 
 
 
Answer
# 3
1) dataset in native format so it can view the data only internally(datastage) where as fileset is in binary format so data can be view in any where which is convert from binary to human understandable language.

2) dataset dont support reject link where as fileset support reject link.

3) dataset is copy operator fileset is import and export operator.
 
Is This Answer Correct ?    3 Yes 0 No
Peddolla
 
Answer
# 4
Dataset operate the file local server and also its support
upto 2 GB Data
File set operates the file local and remote servers and
also its support unlimited Data
 
Is This Answer Correct ?    0 Yes 0 No
Lokesh Butra
 
Answer
# 5
Dataset is same as that of fileset only difference is reject
link and external use.
 
Is This Answer Correct ?    3 Yes 7 No
Prakash
 

 
 
 
Other Data Stage Interview Questions
 
  Question Asked @ Answers
 
WHAT are unix quentios in datastage TCS 2
In Sequential file, how can i split a column into two, and that column contains string datatype. For Example, i have column of string datatype as subedar khaja. Now i want get output as separately with subedar in one column and khaja in second column. How? Coula anybody, solve it? Polaris 2
in sequtial file 2 columns avaliable, i want only one column load the target. for this we can do by modify and copy stage. But here when using modify stage (in property drop column1) until it is ok. if target is data set How to view the data. with out using data management. what is the reason for this. if any body know this answer plz tel me. thanks. IBM 1
I have a scenario like Deptno=10---->First record and last record Deptno=20---->First record and last record Deptno=30---->First record and last record I want those first and last records from each department in a single target. How to do this in DataStage, any one can assist me. Thanks in advance.   7
How to get max salary of an organization using data stage stages........... can any body help me plz....... Cap-Gemini 5
How to add zero "0" before record in a field?   4
Hi I am Vijay In my source i've 10 records in a single column.... but i want to split those records into 5 sequential files each seq file contains 2 records.?.... can any body help me? Scope-International 15
What is the Difference between Change capture stage and Difference Stage ? What are its significance individually ?   1
what is datastage job Monitoring CTS 6
how can we create tables in datastage?   1
Out of 4 mill records only 3 mill records are loaded to target and then job aborted. How to load only those 1 mill(not loaded records) for next run. This job is not sequential job, it is stand alone parallel job.What are the possibilities available in datastage8.1? IBM 6
what is usage of datastage with materialized views HP 4
 
For more Data Stage Interview Questions Click Here 
 
 
 
 
 


   
Copyright Policy  |  Terms of Service  |  Articles  |  Site Map  |  RSS Site Map  |  Contact Us
   
Copyright 2013  ALLInterview.com.  All Rights Reserved.

ALLInterview.com   ::  KalAajKal.com