Explain Spark leftOuterJoin() and rightOuterJoin() operation?
Answer / Krishna Chandra Yadav
Spark leftOuterJoin() is a type of outer join that returns all records from the left RDD (first argument) and matching records from the right RDD (second argument), with null values for missing keys on the right side. If there are no matches, the result contains NULLs on the right side.nSpark rightOuterJoin() is similar, but it returns all records from the right RDD and matching records from the left RDD, with null values for missing keys on the left side. If there are no matches, the result contains NULLs on the left side.
| Is This Answer Correct ? | 0 Yes | 0 No |
Explain about the common workflow of a Spark program?
Please enumerate the various components of the Spark Ecosystem.
What are accumulators in spark?
Write the command to start and stop the spark in an interactive shell?
What is spark sqlcontext?
Define fold() operation in Apache Spark?
What is the difference between spark and apache spark?
How can Spark be connected to Apache Mesos?
Explain values() operation in apache spark?
What is spark and what is its purpose?
What is apache spark written in?
What is shuffle spill in spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)