What is the difference between persist
Answer / Akansha Singh
In Spark, 'persist' and 'cache' are methods used to retain RDDs in memory for reuse. The main difference lies in the level of persistence: 'cache' stores RDDs using the default storage level, while 'persist' allows specifying a custom storage level (such as MEMORY_ONLY, MEMORY_AND_DISK, or DISK_ONLY).
| Is This Answer Correct ? | 0 Yes | 0 No |
What are the roles of the file system in any framework?
How do I get better performance with spark?
How is rdd distributed?
What does repartition do in spark?
What do spark executors manage?
What are the ways to launch Apache Spark over YARN?
Does spark need yarn?
What is the difference between DSM and RDD?
What happens if rdd partition is lost due to worker node failure?
What is write ahead log(journaling)?
How apache spark works?
Does spark use zookeeper?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)