Answer Posted / sat!sh
The data distribution of table among AMPs is called Skew Factor
Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,
One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage
In this situation (unequal distribution of data)we get Skew
Factor High
For this type of tables we should avoid full table scans
ex:
AMP0 AMP1
10000(10%) 9000000(90%)
in this situation skew factor is very high 90%
| Is This Answer Correct ? | 79 Yes | 3 No |
Post New Answer View All Answers
What is inner join and outer join?
Why teradata is used?
Explain the term 'primary key' related to relational database management system?
What's the difference between timestamp (0) and timestamp (6)?
What are the various indexes in teradata? How to use them? Why are they preferred?
What is meant by a Virtual Disk?
What is the command in bteq to check for session settings ?
What are the various indexes in teradata? How to use them?
What is basic teradata query language?
What is spool space? Why do you get spool space errors? How do trouble-shoot them?
What are default access rights in teradata?
What is the particular designated level at which a LOCK is liable to be applied in Teradata?
If Fast Load Script fails and only the error tables are made available to you, then how will you restart?
What interface is used to connect to windows based applications?
Explain vproc in teradata?