Why do Hash joins usually perform better than Merge Joins?



Why do Hash joins usually perform better than Merge Joins?..

Answer / narayana

In MERGE join rows to be join must be present in same AMP.. If the rows to be joined are not on the same AMP, Teradata will either redistribute the data or duplicate the data in spool to make that happen based on row hash of the columns involved in the joins WHERE Clause.Hash join takes place if one or both of the tables on each can fit completely inside the AMP's memory.AMP chooses to hold small tables in its memory for joins happening on ROW hash.
Usually optimizer will first identify a smaller table, and then sort it by the join column row hash sequence. If the smaller table is really small and can fit in the memory, the performance will be best. Otherwise, the sorted smaller table will be duplicated to all the AMPs. Then the larger table is processed one row at a time by doing a binary search against the smaller table for matched record.
Where as in MERGE join Columns to be join is Non INDEXED column. teradata will redistribute the table rows into SPOOL memory and sort them by hash code.So that matching data lies on same amp, so the join can happen on redistributed data

Is This Answer Correct ?    6 Yes 0 No

Post New Answer

More Teradata Interview Questions

What is meant by MATCHTAG in multiload utility?

2 Answers   Satyam,


How does indexing improve query performance?

4 Answers   Satyam,


Difference between client character set and server character set? What is the use of these two?default character set in teradata?

1 Answers   TCS,


Explain fallback in teradata?

0 Answers  


what is a multi statement request in TD? what is the diif b/w V2R5 & TD12? what is the diff b/w MACRO & STORED PROCEDURE?

3 Answers   HCL,






Explain teradata vs. Redshift?

0 Answers  


Can any one please provide me practical example of How to do the performance tuning in Query??

5 Answers   Amdocs,


Differentiate primary key and partition key?

0 Answers  


What are the various etl tools in the market?

0 Answers  


what is identity columns in teradata

2 Answers   Wipro,


What is the primary index in teradata?

0 Answers  


How can bottlenecks be identified?

0 Answers  


Categories