Big Data Interview Questions
Questions Answers Views Company eMail

List the functions of Spark SQL?

80

What is RDD?

98

How to create RDD?

102




Does Apache Spark provide check pointing?

82

Explain about the popular use cases of Apache Spark

80

Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

116

What are the different String functions available in pig?

117

Differentiate between the physical plan and logical plan in Pig script?

135

What are the use cases of Apache Pig?

134

What do you understand by an inner bag and outer bag in Pig?

152

Explain different execution modes available in Pig?

134

How do users interact with HDFS in Apache Pig ?

129

what are the basic parameters of a Mapper?

57

What is a MapReduce Combiner?

69

Where is Mapper output stored?

63







Un-Answered Questions { Big Data }

What is InputSplit and RecordReader?

170


What is a Task instance in Hadoop? Where does it run?1

173


What are the modules that constitute the Apache Hadoop 2.0 framework?

216


What are the limitations of Hive?

259


Can you overwrite Hadoop MapReduce configuration in Hive?

73






What is speculative execution in Hadoop?

294


What are the benefits of block transfer?

175


List the functions of Spark SQL?

80


What is dynamic partitioning and when is it used?

97


What is Reduce only jobs?

197


How would you diagnose or do exception handling in the pig?

115


what are relational operations in pig latin?

149


Differentiate between GROUP and COGROUP operators?

68


What is OutputCommitter?

74


Write a Mapreduce Program for Character Count ?

316