Answer Posted / Sushil Kumar Yadav
RDD (Resilient Distributed Datasets) in Spark is a key concept. It is an immutable distributed collection of objects, each with a partition. RDDs can be transformed and acted upon in parallel across the cluster. They also have fault tolerance mechanisms to handle node failures.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers