Big Data Interview Questions
Questions Answers Views Company eMail

How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

Twitter,

416

Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.

LinkedIn,

406

Explain how RDDs work with Scala in Spark

Capital One,

220

Define HRegionServer in HBase

109

What is the use of shutdown command?

137

What is HBase HMaster?

294

What is the function of HMaster?

148

Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?

115

Explian the Limitations of HBase?

133

State some applications of HBase?

126

What is a Column family in hbase?

152

Discuss about the different tombstone markers used for deletion purposes in HBase.?

136

Explain the Scope operators used in hbase?

127

Is it possible to iterate through the rows of HBase table in reverse order?

155

What is the use of HColumnDescriptor class?

134


Un-Answered Questions { Big Data }

What is the role of alter keyspace?

43


What do you mean by meta information in hdfs? List the documents related to metadata.

18


What are problems with small files and hdfs?

20


How many InputSplits is made by a Hadoop Framework?

348


Please explain apache kafka?

319






What is the driver program in spark?

183


What are the various storages from which Spark can read data?

257


Why Hadoop performs replication, although it results in data redundancy?

1030


What is spark databricks?

200


What is the purpose of RecordReader in hadoop?

231


When to use Hive?

445


How is big data analysis helpful in increasing business revenue?

217


How can we scale apache mahout in cloud?

35


What kind of datawarehouse application is suitable for Hive?

436


Explain the sequence of execution of all the components of MapReduce like a map, reduce, recordReader, split, combiner, partitioner, sort, shuffle.

383