The data distribution of table among AMPs is called Skew Factor Generall

Explain about Skew Factor?

Question Posted / sat!sh

6 Answers
50739 Views
TCS, I also Faced
E-Mail Answers

Answer Posted / sat!sh

The data distribution of table among AMPs is called Skew Factor

Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,

One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage

In this situation (unequal distribution of data)we get Skew
Factor High

For this type of tables we should avoid full table scans

ex:
AMP0 AMP1
10000(10%) 9000000(90%)

in this situation skew factor is very high 90%

Is This Answer Correct ?

79 Yes

3 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What are the various indexes in teradata? How to use them?

570

What are the 5 phases in a multiload utility?

613

How to select first n records in teradata?

611

Explain teradata architecture?

634

What is meant by a Least Cost Plan?

624

How to eliminate product joins in a teradata sql query?

603

Syntax for case when statement?

609

What do you mean by fastexport in teradata?

559

How many sessions of MAX is PE capable of handling at a particular time?

595

Highlight the differences between Primary Key and Primary Index.

620

How many tables can you join in v2r5?

626

How will you solve the problem that occurs during update?

586

Difference between stored procedure and macro?

689

How would you load a very large file in teradata in general? What utility would you use for it? Why?

601

Let us say there is a file that consists of 100 records out of which we need to skip the first and the last 20 records. What will the code snippet?

706