adspace


What is coalesce in spark?

Answer Posted / Sakshi Upadhyay

Coalesce in Spark is an operation that re-partitions a DataFrame or Dataset into a specified number of partitions while ensuring that the total amount of data remains the same. This can help to balance the workload among executors.

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

List the advantage of Parquet file in Apache Spark?

525


What is the latest version of spark?

343


What is meant by Transformation? Give some examples.

385


Explain how RDDs work with Scala in Spark

411