What do you mean by metadata in Hadoop?
What are problems with small files and hdfs?
What is the problem in having lots of small files in hdfs?
List the various HDFS daemons in HDFS cluster?
What is secondary namenode? Is it a substitute or back up node for the namenode?
How to create directory in HDFS?
What is active and passive NameNode in HDFS?
How will you perform the inter cluster data copying work in hdfs?
Explain how indexing is done in hdfs?
What is a rack awareness algorithm and why is it used in hadoop?
How much Metadata will be created on NameNode in Hadoop?
Replication causes data redundancy then why is is pursued in HDFS?
Is namenode also a commodity?
What is hdfs block size?
What are the main properties of hdfs-site.xml file?