Hadoop Interview Questions, Answers for Freshers and Experienced asked in Job Interviews

Apache Hadoop (387)
MapReduce (351)
Apache Hive (334)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (188)

Un-Answered Questions { Hadoop }

How is indexing done in HDFS?

What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?

Why HDFS performs replication, although it results in data redundancy?

132

Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?

What are the key features of HDFS?

How can one set space quota in Hadoop (HDFS) directory?

Explain how HDFS communicates with Linux native file system?

If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?

Replication causes data redundancy then why is is pursued in HDFS?

What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?

What are the main hdfs-site.xml properties?

What do you mean by the High Availability of a NameNode in Hadoop HDFS?

How does HDFS ensure Data Integrity of data blocks stored in HDFS?

What is the difference between RDBMS with Hadoop MapReduce?

794

Explain what does the conf.setMapper Class do in MapReduce?

646