adspace


Why is there a need for broadcast variables when working with Apache Spark?

Answer Posted / Amit Jeet Kumar

Broadcast variables are useful in Apache Spark when a large dataset needs to be accessed by many tasks. By broadcasting the data, it reduces network communication overhead since only the changes in the dataset are sent instead of the entire dataset.

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

List the advantage of Parquet file in Apache Spark?

525


What is meant by Transformation? Give some examples.

385


What is the latest version of spark?

343


Explain how RDDs work with Scala in Spark

411