Describe Accumulator in detail in Apache Spark?



Describe Accumulator in detail in Apache Spark?..

Answer / Manish Kumar Gupta

An Accumulator in Apache Spark is a variable that can be updated across different tasks during an operation. It provides a convenient way to keep track of aggregated results while performing computations on large datasets. Accumulators are useful for iterative algorithms, reducing the need to shuffle data back and forth between nodes.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

Explain benefits of lazy evaluation in RDD in Apache Spark?

1 Answers  


Does spark use tez?

1 Answers  


Can you explain how to minimize data transfers while working with Spark?

1 Answers  


What are the features of spark rdd?

1 Answers  


How can data transfer be minimized when working with Apache Spark?

1 Answers  


How can you trigger automatic clean-ups in Spark to handle accumulated metadata?

1 Answers  


How can you remove the elements with a key present in any other RDD?

1 Answers  


Explain various level of persistence in Apache Spark?

1 Answers  


Explain the use of broadcast variables

1 Answers  


What is broadcast variable?

1 Answers  


Define various running modes of apache spark?

1 Answers  


Name some companies that are already using Spark Streaming?

1 Answers  


Categories