Big Data Interview Questions
Questions Answers Views Company eMail

Explain about the scalar datatypes in Apache Pig?

119

Explain the uses of PIG?

113

What are the complex datatypes in pig?

136




What does illustrate do in Apache Pig?

152

What is UDF?

164

What are the different types of UDF's in Java supported by Apache Pig?

168

What is the usage of foreach operation in Pig scripts?

147

How will you merge the contents of two or more relations and divide a single relation into two or more relations?

162

What is the MapReduce plan in pig architecture?

205

What are the advantages of pig language?

153

What are the different execution mode available in Pig?

186

Define Spark Streaming.

33

Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?

54

What is lineage graph?

77

What are benefits of Spark over MapReduce?

47







Un-Answered Questions { Big Data }

Does 'ILLUSTRATE' run MR job?

56


What is dynamic partitioning and when is it used?

54


what is the default replication factor in HDFS?

149


explain Metadata in Namenode?

125


What are the problems with Hadoop 1.0?

233






Explain the LOAD keyword in Pig script?

39


What are combiners and its purpose?

138


how to share the metastore within multiple users?

140


How would you diagnose or do exception handling in the pig?

84


What is identity mapper and reducer? In which cases can we use them?

240


What will be the consideration while we do Hardware Planning for Master in Hadoop architecture?

132


Name some companies that use Hadoop?

178


What is Derby database?

150


What is RDD?

72


How is reporting controlled in hadoop?

41