What is an "Accumulator"?
Answer / Omkar Singh
An Accumulator is a variable that can be updated across different parallel tasks in Apache Spark. It allows aggregating values from multiple tasks and is useful for computations such as counting the number of occurrences of a certain value.
| Is This Answer Correct ? | 0 Yes | 0 No |
Do I need to know scala to learn spark?
What is Spark DataFrames?
What is the difference between rdd and dataframe in spark?
What is hadoop spark?
What is mllib?
What is flatmap?
What is spark ml?
What is a dataset? What are its advantages over dataframe and rdd?
Does spark use zookeeper?
Define parquet file format? How to convert data to parquet format?
What are the types of cluster managers in spark?
What is a DStream?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)