my source is sequencial file and my target is dataset.
i am running the job in two node configuration file.
my source having 10 records how the data move to target?
Answers were Sorted based on User's Feedback
Answer / bala
Depends on your partitioning method. If its auto - entire
rows will pass in single node.
| Is This Answer Correct ? | 10 Yes | 1 No |
Answer / kavi
If we use Round Robin partitioning method,5 rows in one
node & another 5 rows in 2nd node.
If it is Entire partitioning method,all the 10 rows go into
both the nodes.
| Is This Answer Correct ? | 10 Yes | 1 No |
Answer / ankit gosain
Hi All,
Well it depends upon the partitioning technique used at the
target.
Note: Since source is a Sequential file which is sequential
operator, it will run on a single node, no matter how many
node the configuration file is & all the 10 records will go
sequentially. But Dataset is used as a target which is a
parallel operator, it will run parallely on all the nodes
defined & the data will be distributed according to the
partitioning technique used at the target ( if the records
are not duplicate on the basis of key then we can say that
the data distribution amongs the nodes will be even).
Cheers,
Ankit :)
| Is This Answer Correct ? | 2 Yes | 1 No |
How many partitions does a node can have? Some body explain me clearly about the partitions and the nodes?
how to identifie,is it innerjoin,leftouter join in lookup?
Enlist various types of routines in datastage.
WHAT IS FORCE COMPILE?
Describe routines in datastage? Enlist various types of routines.
How complex jobs are implemented in datstage to improve performance?
Hi, I am getting data from sequential file like below column1 A&B C&D A&E X&Y&Z v&x But I need the target column as like below column1 A and B C and D A and E X and Y and Z v and x
How many Key we can define in remove duplicate stage?
How to initialize environment variables?
can any 1 put a difference between datastage 7.5 and 8.0 all concepts plz r mail 2 me anu4u09@gmail.com
What is configuration your file structure 2)I have two databases both are Oracle while loading data from source to target the job takes 30 min but I want to load less time how?
i have a scenario like two columns(Empno, Ename) in that duplicate records are there, so my question is how to get second duplicate record in datastage.