if we take 2 tables(like emp and dept), we use join stage and
how to improve the performance?
Answers were Sorted based on User's Feedback
Answer / kiran
when ever join 2 tables based on key columns if the key
column is numeric ,set modulus,if the key column is non
numeric set hash partition technique.and compare to look up
join give better performance coz join has sort operation
by default.
| Is This Answer Correct ? | 11 Yes | 2 No |
Answer / ashok
above answer has one mistake
i.e join doesn't has sort operation bydefault we explicitly
specify
| Is This Answer Correct ? | 9 Yes | 3 No |
Hi this is Poorna ,
We can Improve the performance on join stage by doing
pre sorting for both left and right data based on
key .Then we can Improve the performance in join stage .
Plz correct me if any mistake in thinking .
| Is This Answer Correct ? | 6 Yes | 1 No |
Answer / rajeshchunduri
in emp and dept tables key column is deptno so it is key
based and datatype for key column is int . At this time we
change partion tech from hash to modulus.
chunduri
| Is This Answer Correct ? | 1 Yes | 1 No |
Answer / professional
Hi,
For the above query to improve the performance based on key columns in emp and dept joins by default sort in datastage for better performance if you have already a sorted data just go for environmental variables and do the operation #APT_Not_SORTDATA option then performance increase automatically...
| Is This Answer Correct ? | 0 Yes | 0 No |
How to display all the duplicate records from the oracle Table?
col1 123 abc 234 def jkl 768 opq 567 789 but i want two targetss target1 contains only numeric values and target2 contains only alphabet values like trg1 123 234 768 567 789 trg2 abc def jkl opq
What are some different alternative commands associated with "dsjob"?
In Datastage, how you can fix the truncated data error?
Why we use parameters instead of hard code in datastage.
How do you schedule or monitoring the job?
Hi guys, Design job sequence, we have 3 sources, in that 1st source in abort then only run the remaining sources.. How please design the job. Thanks.
how to write server Routine coding?
in a job of 20 one job is very slow due to that entire job is slow how can u find out which job is slow?
how do u reduce warnings
How to find value from a column in a dataset?
how to find no.of records in sequntial file itself?