Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

Explain a simple Map/Reduce problem.

Capital One,

723

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123â€¦Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

Twitter,

748

How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

Twitter,

695

Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.

LinkedIn,

687

Explain how RDDs work with Scala in Spark

Capital One,

342

Define HRegionServer in HBase

157

What is the use of shutdown command?

210

What is HBase HMaster?

360

What is the function of HMaster?

255

Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?

158

Explian the Limitations of HBase?

187

State some applications of HBase?

167

What is a Column family in hbase?

196

Discuss about the different tombstone markers used for deletion purposes in HBase.?

185

Explain the Scope operators used in hbase?

182

Un-Answered Questions { Hadoop }

Explain Sort Order in brief?

164

What is a rack awareness algorithm and why is it used in hadoop?

Is spark based on hadoop?

312

What are the main features of impala?

What is the purpose of sqoop-merge?

Main Components of Hadoop?

769

How is NFS different from HDFS?

What are the three types of tombstone markers in hbase?

152

Define streaming access?

674

How do you write your own SerDe?

767

What is the default block size in hdfs?

1043

What is the use of spark?

275

What are the different pig data types?

747

What is Starvation scenario in spark streaming?

346

Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?

For More Un-Answered { Hadoop } Questions Click Here