if we take 2 tables(like emp and dept), we use join stage and
how to improve the performance?
Answers were Sorted based on User's Feedback
Answer / kiran
when ever join 2 tables based on key columns if the key
column is numeric ,set modulus,if the key column is non
numeric set hash partition technique.and compare to look up
join give better performance coz join has sort operation
by default.
Is This Answer Correct ? | 11 Yes | 2 No |
Answer / ashok
above answer has one mistake
i.e join doesn't has sort operation bydefault we explicitly
specify
Is This Answer Correct ? | 9 Yes | 3 No |
Hi this is Poorna ,
We can Improve the performance on join stage by doing
pre sorting for both left and right data based on
key .Then we can Improve the performance in join stage .
Plz correct me if any mistake in thinking .
Is This Answer Correct ? | 6 Yes | 1 No |
Answer / rajeshchunduri
in emp and dept tables key column is deptno so it is key
based and datatype for key column is int . At this time we
change partion tech from hash to modulus.
chunduri
Is This Answer Correct ? | 1 Yes | 1 No |
Answer / professional
Hi,
For the above query to improve the performance based on key columns in emp and dept joins by default sort in datastage for better performance if you have already a sorted data just go for environmental variables and do the operation #APT_Not_SORTDATA option then performance increase automatically...
Is This Answer Correct ? | 0 Yes | 0 No |
What are the features of datastage flow designer?
How can you join flat file, oracle as a sources?
i want anser this question empno,ename,sal 12,mmm_ww,200 13,nnn_xx,300 14,bbb_qq,400 which stages are take which types of logicks are doing pls help me i don't need "_"ex nnnxx this type i want
Why we use parameters instead of hard code in datastage.
How many jobs in ur project? Explain any complex job u have done in ur project?
Hi friends,Two input files, wants to validate only if the reference data has '0' otherwise no validation should be done..how to do this??
In one project how many shared containers are created?
How can i approach to write datastage 7.5 Certification? and how much they will charge for examination .What exactly should i do? Can anyone guide me plz?
I have 2 files 1st contains duplicate records only, 2nd file contains Unique records.EX: File1: 1 subhash 10000 1 subhash 10000 2 raju 20000 2 raju 20000 3 chandra 30000 3 chandra 30000 File2: 1 subhash 10000 5 pawan 15000 7 reddy 25000 3 chandra 30000 Output file:-- capture all the duplicates in both file with count. 1 subhash 10000 3 1 subhash 10000 3 1 subhash 10000 3 2 raju 20000 2 2 raju 20000 2 3 chandra 30000 3 3 chandra 30000 3 3 chandra 30000 3
I/P --- ID Value 1 AB 2 ABC 3 ADE 4 A O/p --- ID Value 1 A 1 B 2 A 2 B 2 C 3 A 3 D 3 E 4 A
if a column contains data like ram,rakesh,madhan,suraj,pradeep,bhaskar then I want to place names separated by commas in another columns how can we do?
Can you implement SCD2 using join, transformer and funnel stage?