Why Apache Spark?
Explain transformation and action in RDD in Apache Spark?
How can we create RDD in Apache Spark?
What is SparkSession in Apache Spark?
What is the Difference SparkSession vs SparkContext in Apache Spark?
What are the abstractions of Apache Spark?
What is SparkContext in Apache Spark?
What are the components of Apache Spark Ecosystem?
Explain the RDD properties?
What does ‘jps’ command do?
Mention what is the use of Context Object?
What is commodity hardware?
Explain what is sqoop in Hadoop ?
Explain how do ‘map’ and ‘reduce’ works?
Explain what is the purpose of RecordReader in Hadoop?