Answer Posted / Avneesh Kumar Maurya
RDD (Resilient Distributed Dataset) is the fundamental data structure in Apache Spark. RDDs are immutable distributed collections of objects that can be processed in parallel. They are fault-tolerant, meaning if a worker node fails, the computation can be restarted on another node without losing any data.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers