Big Data Interview Questions
Questions Answers Views Company eMail

How is the distance between two nodes defined in Hadoop?

1128

What is bucketing in Hive?

1 1529

What is speculative execution in Hadoop?

767

How we can change Replication factor when Data is on the fly?

1026

What alternate way does HDFS provides to recover data in case a Namenode

659

What are the problems with Hadoop 1.0?

764

what are the nodes in the Hadoop cluster?

658

What are the differences between Hive and RDBMS?

1 1709

What Mapper does?

657

What is your favourite tool in the hadoop ecosystem?

639

What is the function of NodeManager?

655

Difference between SQL and HiveQL ?

1 2936

what is pig?

518

What are the steps involved in MapReduce framework?

603

explain Metadata in Namenode?

595


Un-Answered Questions { Big Data }

If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?

20


What is the inputsplit in map reduce software?

368


How to copy a file into HDFS with a different block size to that of existing block size configuration?

23


List few benefits of spark over map reduce?

208


Explain the difference between mapreduce engine and hdfs cluster?

36






What are the complicated steps in Flume configurations?

103


What is the problem with small files in Apache Hadoop?

443


What a task tracker is in hadoop?

237


Big Data Engineer Can you explain what REST is?

5


Mention what is the hadoop mapreduce apis contract for a key and value class?

417


What is active and passive NameNode in Hadoop?

243


Hadoop sqoop is which type of tool?

5


Does kafka use hdfs?

294


How does reducebykey work in spark?

177


What is a reliable and unreliable receiver in Spark?

227