my source is sequencial file and my target is dataset.
i am running the job in two node configuration file.
my source having 10 records how the data move to target?
Answer Posted / ankit gosain
Hi All,
Well it depends upon the partitioning technique used at the
target.
Note: Since source is a Sequential file which is sequential
operator, it will run on a single node, no matter how many
node the configuration file is & all the 10 records will go
sequentially. But Dataset is used as a target which is a
parallel operator, it will run parallely on all the nodes
defined & the data will be distributed according to the
partitioning technique used at the target ( if the records
are not duplicate on the basis of key then we can say that
the data distribution amongs the nodes will be even).
Cheers,
Ankit :)
| Is This Answer Correct ? | 2 Yes | 1 No |
Post New Answer View All Answers
Demonstrate experience in maintaining quality process standards?
Explain entity, attribute and relationship in datastage?
how to connect source to db?generally what r stages u used? how to find the data is having delimiter format?
What is a merge?
How to convert RGB Value to Hexadecimal values in datastage?
Difference between server jobs & parallel jobs?
How can one find bugs in job sequence?
what is stage is used for below Input columns: dept|mgr|employee|salary Output columns: mgr|count of employee per mgr|avg salary per dept note: each dept has one mgr and each mgr has many employees
What are the types of hashed files in data stage
What are the differences between datastage and informatica?
Is it possible to query a hash file?
1)what is the size of Fact table and dimension table? 2)how to find the size of Fact table and dimension table? 3)how to implement the surrogate key in transform stage? 4)write the configuration file path? 5)how many types of datasets explain? 6)diff b/w developed projects and migration projects? 7)how to delete the header and footer file of the sequencer file? 8)how can u call the parameters in DS in unix environment? 9) how much data ur getting daily ? 10)
Differentiate between data file and descriptor file?
create a job that splits the data in the Jobs.txt file into
four output files. You will direct the data to the
different output files using constraints. • Job name:
JobLevels
• Source file: Jobs.txt
• Target file 1: LowLevelJobs.txt
− min_lvl between 0 and 25 inclusive.
− Same column types and headings as Jobs.txt.
− Include column names in the first line of the output file.
− Job description column should be preceded by the
string “Job
Title:” and embedded within square brackets. For example, if
the job description is “Designer”, the derived value
is: “Job
Title: [Designer]”.
• Target file 2: MidLevelJobs.txt
− min_lvl between 26 and 100 inclusive.
− Same format and derivations as Target file 1.
• Target file 3: HighLevelJobs.txt
− min_lvl between 101 and 500 inclusive.
− Same format and derivations as Target file 1.
• Rejects file: JobRejects.txt
− min_lvl is out of range, i.e., below 0 or above 500.
− This file has only two columns: job_id and reject_desc.
− reject_desc is a variable-length text field, maximum
length
100. It should contain a string of the form: “Level out of
range:
Explain Quality stage?