How is spark fault tolerance?
Answer / Apurav Garg
Apache Spark achieves fault tolerance by storing multiple copies of the same data across different nodes in a cluster. When a task fails, it can be re-executed on another node that has a copy of the data. Additionally, Spark maintains lineage information, allowing it to recalculate dependencies if needed.
| Is This Answer Correct ? | 0 Yes | 0 No |
What do you understand by worker node?
How do I download spark?
What are the downsides of Spark?
What is the spark driver?
Explain the action count() in Spark RDD?
How is streaming implemented in spark?
What is the default partition in spark?
Is apache spark an etl tool?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
Why is Transformation lazy in Spark?
Compare Transformation and Action in Apache Spark?
Write the command to start and stop the spark in an interactive shell?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)