Big Data Interview Questions
Questions Answers Views Company eMail

List the functions of Spark SQL?

60

What is RDD?

78

How to create RDD?

80




Does Apache Spark provide check pointing?

67

Explain about the popular use cases of Apache Spark

61

Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

85

What are the different String functions available in pig?

102

Differentiate between the physical plan and logical plan in Pig script?

113

What are the use cases of Apache Pig?

116

What do you understand by an inner bag and outer bag in Pig?

124

Explain different execution modes available in Pig?

102

How do users interact with HDFS in Apache Pig ?

107

what are the basic parameters of a Mapper?

42

What is a MapReduce Combiner?

48

Where is Mapper output stored?

46







Un-Answered Questions { Big Data }

What will you do when NameNode is down?

192


Can Apache Kafka be used without Zookeeper?

185


What are the different Relational Operators available in pig language?

68


what is Speculative Execution?

50


Why Mapreduce output written in local disk?

167






Does Pig support multi-line commands?

138


Is it possible to split 100 lines of input as a single split in MapReduce?

56


What is the MapReduce plan in pig architecture?

222


How can an application connect to Hive run as a server?

363


what is a datanode?

141


What are the limitations of importing RDBMS tables into Hcatalog directly?

177


What are Flume events?

5


What is the difference between HDFS and NAS ?

473


What ate the key components of Hive Architecture?

76


How can you add the arbitrary key-value pairs in your mapper?

316