Out of 4 mill records only 3 mill records are loaded to
target
and then job aborted. How to load only those 1 mill(not
loaded records) for next run.
This job is not sequential job, it is stand alone parallel
job.What are the possibilities available in datastage8.1?
Answers were Sorted based on User's Feedback
Answer / prasanna
Use already loaded records TGT as reference lookup,and use
lookup stage,with DROP option..and design the job.u ll get
req answer. refer
.
.
sour.......>lookup......>tgt
| Is This Answer Correct ? | 7 Yes | 4 No |
Answer / nish
there are plenty of options available.
just carefully study the scenario.
Source: 4 Mil records (doesn't change)
Target: 3 Million already loaded
Option1:
you just need to identify those 1Million pool them and then load to target.
This is clearly a case for Change Data Capture (CDC) stage.
use the Source As Before Table and Target as After.
Write those 1 Mil records based on change_code() (Deleted) to a file.
Move the contents of this file to target.
Option 2: This scenario also hints at updating the target with merge stage.use the DROP option to gather into a file and then update the target.
Option 3: Look Up being a performance concern should be your alternative.
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / venkatesh
first load the data using with surrogate stage
after that use filter stage and take that key of
surrogateand in lookup maintain the Drop option
| Is This Answer Correct ? | 2 Yes | 8 No |
What are the different options associated with dsjob command?
In Sequential file, how can i split a column into two, and that column contains string datatype. For Example, i have column of string datatype as subedar khaja. Now i want get output as separately with subedar in one column and khaja in second column. How? Coula anybody, solve it?
What is use Array size in datastage
what about data stage requirement
where the log files or tables can store in DS?
What is the importance of the exception activity in datastage?
Lookup constraints
what is the use of skid in reporting?
What are the job parameters?
What are the different common services in datastage?
HOW CAN U DO ERROR HANDLING IN DATA STAGE?
How can you write parallel routines in datastage PX?