Big Data Interview Questions, Answers for Freshers and Experienced asked in Job Interviews

Apache Hadoop (387)
MapReduce (351)
Apache Hive (334)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (188)

Big Data General (101)
Big Data AllOther (3)

Un-Answered Questions { Big Data }

What does the Spark Engine do?

333

How Spark uses Akka?

332

How Spark handles monitoring and logging in Standalone mode?

347

What is Hadoop serialization?

750

Explain a simple Map/Reduce problem.

786

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123â€¦Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

787

How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

730

Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.

734

Explain how RDDs work with Scala in Spark

352

Define HRegionServer in HBase

161

What is the use of shutdown command?

214

What is HBase HMaster?

362

What is the function of HMaster?

280

Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?

162

Explian the Limitations of HBase?

199