List out the ways of creating RDD in Apache Spark?
Answer / Gyanesh Kumar
The following are the ways to create RDDs in Apache Spark:
1. Creating an RDD from a local collection using parallelize() method.
2. Reading data from an external file or HDFS file system.
3. Using SQLContext or DataFrame API to convert DataFrames and Datasets into RDDs.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is SparkSession in Apache Spark? Why is it needed?
Which serialization libraries are supported in spark?
What is the disadvantage of spark sql?
How Spark uses Hadoop?
What can I do with my m&s sparks points?
Does spark replace hadoop?
Can you explain spark mllib?
What is hdfs spark?
What are the ways to launch Apache Spark over YARN?
Is spark an etl?
What do you understand about yarn?
What is spark architecture?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)