What are benefits of Spark over MapReduce?
List the functions of Spark SQL?
What is RDD?
How to create RDD?
Does Apache Spark provide check pointing?
Explain about the popular use cases of Apache Spark
Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?
What are the different String functions available in pig?
Differentiate between the physical plan and logical plan in Pig script?
What are the use cases of Apache Pig?
What do you understand by an inner bag and outer bag in Pig?
Explain different execution modes available in Pig?
How do users interact with HDFS in Apache Pig ?
what are the basic parameters of a Mapper?
What is a MapReduce Combiner?