Why is Transformation lazy in Spark?
Answer / Romi Awasthi
Transformations are lazy in Spark to optimize resource utilization. When a transformation is called, it doesn't immediately execute the action on all data; instead, it creates a logical plan outlining the steps required to process the data. The physical execution starts only when an action like collect(), count(), or save() is triggered.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the difference between dataset and dataframe in spark?
State the difference between Spark SQL and Hql
Explain the lookup() operation in Spark?
Define RDD?
How many partitions are created by default in Apache Spark RDD?
Should I install spark on all nodes of yarn cluster?
What are the various libraries available on top of Apache Spark?
Define parquet file format? How to convert data to parquet format?
Is a distributed machine learning framework on top of spark?
What are the ways in which Apache Spark handles accumulated Metadata?
Can you define yarn?
How is rdd fault?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)