What is an rdd?



What is an rdd?..

Answer / Vikas Saini

An RDD (Resilient Distributed Dataset) is a fundamental data structure in Apache Spark, used for distributed storage and computation. It is an immutable distributed collection of objects that can be processed in parallel across multiple nodes in a cluster.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

Explain Spark streaming?

1 Answers  


Explain the operation reduce() in Spark?

1 Answers  


Who creates dag in spark?

1 Answers  


What are the exact differences between reduce and fold operation in Spark?

1 Answers  


Explain the top() and takeordered() operation?

1 Answers  


What is external shuffle service in spark?

1 Answers  


Does spark work with python 3?

1 Answers  


explain the key features of Apache Spark?

1 Answers  


What is executor memory in a spark application?

1 Answers  


What apache spark is used for?

1 Answers  


What is the difference between spark and hive?

1 Answers  


Define "PageRank".

1 Answers  


Categories