Big Data Interview Questions
Questions Answers Views Company eMail

Why is Data Block size set to 128 MB in Hadoop?

237

What is a Heartbeat in Hadoop?

303

Comparison between Secondary NameNode and Checkpoint Node in Hadoop?

272

What is a Backup node in Hadoop?

255

What do you mean by the NameNode High Availability in hadoop?

252

What is the default replication factor in Hadoop and how will you change it?

282

Why Hadoop performs replication, although it results in data redundancy?

1030

What is Balancer in Hadoop?

216

What is active and passive NameNode in Hadoop?

247

How can one check whether NameNode is working or not?

271

How would you restart NameNode?

215

How NameNode tackle Datanode failures in Hadoop?

236

Is Namenode machine same as DataNode machine as in terms of hardware in Hadoop?

245

If DataNode increases, then do we need to upgrade NameNode in Hadoop?

240

What is meant by streaming access?

282


Un-Answered Questions { Big Data }

Why do we need indexing?

425


Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?

382


What does dag stand for?

199


Can we say cogroup is a group of more than 1 data set?

446


Differentiate Reducer and Combiner in Hadoop MapReduce?

410






What are the differences between hadoop 1 and hadoop 2?

237


Explain when to use explode in Hive?

424


Specify what the information segments utilized by hadoop are?

236


Why is flume used?

50


Explain about the partitioning, shuffle and sort phase

388


Does impala support generic jdbc?

35


Why should we use presto?

5


What is kafka Producer?

333


Can any impala query also be executed in hive?

76


Explain HCatInputFormat and HCatOutputFormat?

5