adspace


What will be the skew for,
input file->partition by key-> partition by round robin->output file

Answer Posted / Shiv Shakti Shankar

In a data processing pipeline that includes partitioning by both key and round robin, the skew refers to the variation in the number of records distributed across partitions. While it's difficult to predict an exact value for the skew without knowing the input data distribution, using round robin as the second level of partitioning helps to reduce overall skew and balance the load between partitions.

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

What is a rollup component? Explain about it.

1256


What is rollup component?

1352


How to add default rules in transformer?

1298


can any one help me now i am learning AB Inito but i don't have material and pdf's can any one provide pdf's to this mail id mohanraju0113@gmail.com thanks in advance...................................

2340


Hi friends, what are the new features in abinito3.0?

2868