Answer Posted / Krishana Chaudhary
Broadcast variables are used in Apache Spark to distribute large data that needs to be accessed by multiple tasks but not modified. Unlike RDDs, broadcast variables cannot be transformed or actioned upon. They are useful for scenarios where a task requires referencing large amounts of data, such as machine learning algorithms, that can benefit from distributing the data across all nodes in a cluster without duplicating it.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers