What is coalesce in spark?



What is coalesce in spark?..

Answer / Sakshi Upadhyay

Coalesce in Spark is an operation that re-partitions a DataFrame or Dataset into a specified number of partitions while ensuring that the total amount of data remains the same. This can help to balance the workload among executors.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What is hadoop technology?

1 Answers  


What is data skew and how do you fix it?

1 Answers  


Name the operations supported by rdd?

1 Answers  


What is master node in spark?

1 Answers  


What are the ways in which Apache Spark handles accumulated Metadata?

1 Answers  


Define RDD?

1 Answers  


List out the various advantages of dataframe over rdd in apache spark?

1 Answers  


Can you explain apache spark?

1 Answers  


List some commonly used Machine Learning Algorithm Apache Spark?

1 Answers  


What is map side join?

1 Answers  


What are the different ways of representing data in Spark?

1 Answers  


Explain distnct(),union(),intersection() and substract() transformation in Spark?

1 Answers  


Categories