Answer Posted / Jaideep Shrivastava
A Resilient Distributed Dataset (RDD) is an immutable distributed collection of data that Spark reads and writes in parallel. It's a fundamental data structure in Apache Spark, which can be created from Hadoop datasets, programming language collections, or custom data sources.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers