Answer Posted / sat!sh
The data distribution of table among AMPs is called Skew Factor
Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,
One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage
In this situation (unequal distribution of data)we get Skew
Factor High
For this type of tables we should avoid full table scans
ex:
AMP0 AMP1
10000(10%) 9000000(90%)
in this situation skew factor is very high 90%
| Is This Answer Correct ? | 79 Yes | 3 No |
Post New Answer View All Answers
What is primary index and secondary index?
Syntax for case when statement?
Can we have two time dimensions in a schema(either star or snow flake)? For ex if we want joining date of employee and if we want today's sales with time whether can we have two time dimensions for accommodating above tasks?
What are some commonly used bteq scripts?
What is the purpose of upsert command?
How do you see a ddl for an existing table?
How does hashing happens in teradata?
Explain the term 'row' related to relational database management system?
Why managing the data is important?
How would you load a very large file in teradata in general? What utility would you use for it? Why?
What are the various indexes in teradata? Why are they preferred?
Difference between star and snowflake schemas?
How do you define Teradata?
What are the components provided on node?
Hi, If anyone has TD 14 Basics dumps or study materials, please share. nirmaaal1991@gmail.com