Big Data Interview Questions
Questions Answers Views Company eMail

Explain about the scalar datatypes in Apache Pig?

129

Explain the uses of PIG?

123

What are the complex datatypes in pig?

148




What does illustrate do in Apache Pig?

165

What is UDF?

176

What are the different types of UDF's in Java supported by Apache Pig?

182

What is the usage of foreach operation in Pig scripts?

157

How will you merge the contents of two or more relations and divide a single relation into two or more relations?

172

What is the MapReduce plan in pig architecture?

222

What are the advantages of pig language?

162

What are the different execution mode available in Pig?

197

Define Spark Streaming.

40

Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?

60

What is lineage graph?

87

What are benefits of Spark over MapReduce?

54







Un-Answered Questions { Big Data }

What are the file formats that Hive supports and can use be used for storage?

70


Where is the Mapper Output stored?

535


What is the input type/format in MapReduce by default?

314


What is 'Key value pair' in HDFS?

153


what are the steps involved in decommissioning removing

136






What are the different operational commands in HBase at record level and table level?

121


Explain the LOAD keyword in Pig script?

47


Explain the need for MapReduce while programming in Apache Pig?

82


Is there any difference between HBase datamodel and RDBMS datamodel?

764


What is a MapReduce Combiner?

48


Why do we need a new framework for handling big data?

63


Explain the Reducer's reduce phase?

251


What is the Job interface in MapReduce framework?

202


What is a JobTracker in Hadoop? How many instances of JobTracker run on a Hadoop Cluster?

309


What alternate way does HDFS provides to recover data in case a Namenode

203