What is difference between rdd and dataframe?



What is difference between rdd and dataframe?..

Answer / Satyendra Kumar Tiwari

RDD (Resilient Distributed Dataset) is a fundamental data structure in Apache Spark that represents an immutable distributed collection of objects. DataFrames, on the other hand, provide a programming interface for manipulating structured data (such as tables with columns and rows), including support for SQL-like queries and more advanced data types beyond primitives like integers and strings.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

What can I do with my m&s sparks points?

1 Answers  


What is scala spark?

1 Answers  


What is a databricks cluster?

1 Answers  


What is a worker node in Apache Spark?

1 Answers  


Can we run spark on windows?

1 Answers  


What do spark executors manage?

1 Answers  


What are the benefits of lazy evaluation?

1 Answers  


Explain Spark coalesce() operation?

1 Answers  


What is spark deploy mode?

1 Answers  


What is Sparse Vector?

1 Answers  


How can we create RDD in Apache Spark?

1 Answers  


Is spark part of hadoop?

1 Answers  


Categories