Answer Posted / sat!sh
The data distribution of table among AMPs is called Skew Factor
Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,
One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage
In this situation (unequal distribution of data)we get Skew
Factor High
For this type of tables we should avoid full table scans
ex:
AMP0 AMP1
10000(10%) 9000000(90%)
in this situation skew factor is very high 90%
Is This Answer Correct ? | 79 Yes | 3 No |
Post New Answer View All Answers
What is the meaning of Caching in Teradata?
What is TPD?
How can bottlenecks be identified?
How to explain project Architecture and flow in teradata interviews?Can please anyone help on this? Am new to teradata.
What are the primary characteristics of the Teradata.
What are the functions of a teradata dba?
What is called partitioned primary index (ppi)?
What happens when a node suffers a downfall?
How do you determine the number of sessions?
What is bteq utility in teradata?
If the PMON is not working then how do you manage and monitor all processes, resources and sessions etc.
Discuss the advantages of using partitioned primary index in a query?
What is difference between user and database in teradata?
Briefly explain each of the following terms related to relational database management system (rdbms) – database, tables, columns, row, primary key and foreign key.
What are the uses of bynets in multi-node systems?