A Resilient Distributed Dataset (RDD) is an immutable distributed collectio

Define RDD?

Question Posted / Jaideep Shrivastava

1 Answers
340 Views
I also Faced
E-Mail Answers

Answer Posted / Jaideep Shrivastava

A Resilient Distributed Dataset (RDD) is an immutable distributed collection of data that Spark reads and writes in parallel. It's a fundamental data structure in Apache Spark, which can be created from Hadoop datasets, programming language collections, or custom data sources.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

What is meant by Transformation? Give some examples.

328

What is the latest version of spark?

288

List the advantage of Parquet file in Apache Spark?

474

Explain how RDDs work with Scala in Spark

355