How is rdd fault?
Answer / Supriya Suman
"RDD Fault Tolerance" in Apache Spark ensures that data processing can continue even if one or more worker nodes fail. By default, Spark stores each RDD partition on multiple replicas across different nodes to ensure high availability. When a task fails, the system restores the data from replicas and recomputes the failed task.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is spark deploy mode?
What is serialization in spark?
Can you run spark without hadoop?
Explain first() operation in Apache Spark RDD?
What file systems does spark support?
How do we represent data in Spark?
What is master node in spark?
List the benefits of Spark over MapReduce.
What is lineage graph in Apache Spark?
List the languages supported by Apache Spark?
Explain caching in spark streaming.
What does map transformation do? Provide an example.
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)