What operations RDD support?
Answer / Ravi Kumar Gupta
RDD supports various transformations (like map, filter, and join) and actions (like count, saveAsTextFile, and reduce). These operations can be chained together to create complex data processing pipelines.
| Is This Answer Correct ? | 0 Yes | 0 No |
Explain the action count() in Spark RDD?
When should you use spark cache?
What are the major features/characteristics of rdd (resilient distributed datasets)?
Why scala is used in spark?
State the difference between persist() and cache() functions.
What is Spark MLlib?
Why we need compression and what are the different compression format supported?
What is a spark shuffle?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)
What is Immutable?
What is map in spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)