Explain the hdfs architecture and list the various hdfs daemons in hdfs cluster?
How to format the HDFS? How frequently it will be done?
What is secondary namenode? Is it a substitute or back up node for the namenode?
What is a task tracker?
What is throughput in HDFS?
Can you explain heartbeat in hdfs?
how does hdfs ensure data integrity of data blocks stored in hadoop hdfs?
How to Delete directory and files recursively from HDFS?
What is the difference between NAS and HDFS?
While processing data from hdfs, does it execute code near data?
Explain HDFS “Write once Read many” pattern?
Explain how indexing in hdfs is done?
How does hdfs ensure information integrity of data blocks squares kept in hdfs?
Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
How does HDFS Index Data blocks? Explain.