What is the difference between rdd and dataframe in spark?
Answer Posted / Avinash Sharma
RDD is a fundamental distributed collection of data in Spark, offering flexible operations for structured or unstructured data. DataFrames and Datasets provide additional structure with a schema (schema-less for Datasets) and built-in optimizations for SQL-like queries and high-level API functionality.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers