Big Data Interview Questions
Questions Answers Views Company eMail

What are the relational operators available related to combining and splitting in pig language?

52

Highlight the key differences between MapReduce and Apache Pig?

44

How to write 'foreach' statement for bag datatype in pig scripts?

67




State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?

47

Mention the common features in Pig and Hive?

77

What are the debugging tools used for Apache Pig scripts?

63

Differentiate between Hadoop MapReduce and Pig?

48

What is a UDF in Pig?

51

Differentiate between GROUP and COGROUP operators?

40

What is BloomMapFile?

52

Highlight the difference between group and Cogroup operators in Pig?

52

How would you diagnose or do exception handling in the pig?

84

What are the relation operations in Pig? Explain any two with examples?

95

Differentiate between Pig Latin and Pig Engine?

102

What is the function of co-group in Pig?

115







Un-Answered Questions { Big Data }

What is HDFS High Availability?

209


Virtual Box & Ubuntu Installation?

135


What are the sources generating big data?

45


What is Apache Flume?

7


What will you do when NameNode is down?

175






What do you know about collaborative filtering?

36


how indexing in HDFS is done?

37


On what basis Namenode will decide which datanode to write on?

390


What are combiners and its purpose?

138


How do you define "block" in HDFS?

121


How is the distance between two nodes defined in Hadoop?

469


What are some of the interesting facts about Big Data?

36


What are active and passive "NameNodes"?

322


When Hive is run in embedded mode

851


how JobTracker schedules a task ?

36