What are the different ways of representing data in Spark?



What are the different ways of representing data in Spark?..

Answer / Vimal Kumar Singh

Data can be represented in Apache Spark using DataFrames, RDDs (Resilient Distributed Datasets), and DataSets. DataFrames offer a programming interface similar to SQL, while RDDs provide more flexibility but less optimization.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

Is spark an etl?

1 Answers  


Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?

1 Answers  


Why spark is faster than hadoop?

1 Answers  


What is paired rdd in spark?

1 Answers  


Where is apache spark used?

1 Answers  


How can you achieve high availability in Apache Spark?

1 Answers  


What are accumulators in spark?

1 Answers  


How can apache spark be used alongside hadoop?

1 Answers  


Is scala required for spark?

1 Answers  


Is there any benefit of learning MapReduce, then?

1 Answers  


What is spark lineage?

1 Answers  


Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

1 Answers  


Categories