What are the advantages of pig language?
What are the different execution mode available in Pig?
Define Spark Streaming.
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
What is lineage graph?
What are benefits of Spark over MapReduce?
List the functions of Spark SQL?
What is RDD?
How to create RDD?
Does Apache Spark provide check pointing?
Explain about the popular use cases of Apache Spark
Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?
What are the different String functions available in pig?
Differentiate between the physical plan and logical plan in Pig script?
What are the use cases of Apache Pig?