What is an "Accumulator"?
Answer / Omkar Singh
An Accumulator is a variable that can be updated across different parallel tasks in Apache Spark. It allows aggregating values from multiple tasks and is useful for computations such as counting the number of occurrences of a certain value.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is sparkContext?
Is apache spark a database?
What is lineage graph?
Explain the difference between Spark SQL and Hive.
What is the difference between reducebykey and groupbykey?
On which all platform can Apache Spark run?
What is apache spark in big data?
What is the standalone mode in spark cluster?
What is spark mapvalues?
How does yarn work with spark?
Explain Spark saveAsTextFile() operation?
Is hadoop mandatory for spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)