What will be the skew for,
input file->partition by key-> partition by round robin->output file
Answer / Shiv Shakti Shankar
In a data processing pipeline that includes partitioning by both key and round robin, the skew refers to the variation in the number of records distributed across partitions. While it's difficult to predict an exact value for the skew without knowing the input data distribution, using round robin as the second level of partitioning helps to reduce overall skew and balance the load between partitions.
| Is This Answer Correct ? | 0 Yes | 0 No |
What are differences between different gde versions(1.10,1.11,1.12,1.13and 1.15)?what are differences between different versions of co-op?
Give one reason when you need to consider multiple data processing?
Have you used rollup component? Describe how.
Explain what is sort component in abinitio?
Can anyone give me an exaple of realtime start script in the graph?
Have you used the rollup component? Describe how?
Define ramp limit in ab initio?
What is the difference between a utility and api in a RUN SQL component
What is a cursor? Within a cursor, how would you update fields on the row just fetched?
What is the diff between abinitiorc and .abinitiorc files ?
Where $mpjret is used in ab-initio?
what is SSH?What is the differences between the SSH,SSH1,SSH2?