What is difference between dataset and dataframe in spark?
Answer Posted / Pradeep Prasad
"A Dataset is a distributed collection of data with a strong type, which means that each column has a specified Java or Scala data type. It provides the benefits of both RDDs (Resilient Distributed Datasets) and DataFrames, with the added advantage of static typing. On the other hand, DataFrame is a distributed collection of data organized into named columns, but it does not have strong types for each column."
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers