Answer Posted / Ateequr Rehman
In Apache Spark, a DataFrame represents a distributed collection of data organized into named columns. It is similar to a table in a relational database or a data frame in R and Python. A DataFrame can be created from various data sources (CSV files, JSON files, databases, etc.) and provides an efficient way to perform batch and iterative computations on large datasets.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers