What are the different ways of representing data in Spark?

What are the different ways of representing data in Spark?

Question Posted / faheem husain

1 Answers
364 Views
I also Faced
E-Mail Answers

What are the different ways of representing data in Spark?..

Answer / Vimal Kumar Singh

Data can be represented in Apache Spark using DataFrames, RDDs (Resilient Distributed Datasets), and DataSets. DataFrames offer a programming interface similar to SQL, while RDDs provide more flexibility but less optimization.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

Is spark an etl?

Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?

Why spark is faster than hadoop?

What is paired rdd in spark?

Where is apache spark used?

How can you achieve high availability in Apache Spark?

What are accumulators in spark?

How can apache spark be used alongside hadoop?

Is scala required for spark?

Is there any benefit of learning MapReduce, then?

What is spark lineage?

Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)