What are the abstractions of Apache Spark?
What is SparkContext in Apache Spark?
What are the components of Apache Spark Ecosystem?
Explain the RDD properties?
What does ‘jps’ command do?
Mention what is the use of Context Object?
What is commodity hardware?
Explain what is sqoop in Hadoop ?
Explain how do ‘map’ and ‘reduce’ works?
Explain what is the purpose of RecordReader in Hadoop?
How to restart Namenode?
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?
Mention what are the data components used by Hadoop?
What does /etc /init.d do?
What is a Combiner?