What will be the skew for,
input file->partition by key-> partition by round robin->output file



What will be the skew for, input file->partition by key-> partition by round robin->outpu..

Answer / Shiv Shakti Shankar

In a data processing pipeline that includes partitioning by both key and round robin, the skew refers to the variation in the number of records distributed across partitions. While it's difficult to predict an exact value for the skew without knowing the input data distribution, using round robin as the second level of partitioning helps to reduce overall skew and balance the load between partitions.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Ab Initio Interview Questions

What are differences between different gde versions(1.10,1.11,1.12,1.13and 1.15)?what are differences between different versions of co-op?

1 Answers  


Give one reason when you need to consider multiple data processing?

1 Answers  


Have you used rollup component? Describe how.

1 Answers  


Explain what is sort component in abinitio?

1 Answers  


Can anyone give me an exaple of realtime start script in the graph?

1 Answers  


Have you used the rollup component? Describe how?

1 Answers  


Define ramp limit in ab initio?

1 Answers  


What is the difference between a utility and api in a RUN SQL component

1 Answers   Accenture,


What is a cursor? Within a cursor, how would you update fields on the row just fetched?

1 Answers  


What is the diff between abinitiorc and .abinitiorc files ?

1 Answers   IBM, Infosys, TCL,


Where $mpjret is used in ab-initio?

1 Answers  


what is SSH?What is the differences between the SSH,SSH1,SSH2?

2 Answers  


Categories