Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

What is NameNode and DataNode in HDFS?

35


Why HDFS stores data using commodity hardware despite the higher chance of failures?

21


What are file permissions in HDFS and how HDFS check permissions for files or directory?

24


How data or file is written into HDFS?

37


What is throughput? How does HDFS get a good throughput?

34


What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?

31


List the various HDFS daemons in HDFS cluster?

20


What is the difference between NAS and HDFS?

55


If data is present in HDFS and RF is defined, then how can we change Replication Factor?

24


What do you mean by metadata in HDFS? Where is it stored in Hadoop?

71


What is the optimal block size in HDFS?

30


What do you mean by metadata in HDFS?

42


Data node block size in HDFS, why 64MB?

30


What happens if the block in HDFS is corrupted?

19


What is Fault Tolerance in HDFS?

29