Answer Posted / Bhaskar Shukla
RDD (Resilient Distributed Dataset) is a distributed collection of data that can be operated on in parallel across a cluster. It is the fundamental data structure in Apache Spark, which can be created from various sources like HDFS files, text files, or other RDDs.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers