Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

Explain the hdfs architecture?

39


How does hdfs provides good throughput?

28


Can we have different replication factor of the existing files in hdfs?

38


How will you perform the inter cluster data copying work in hdfs?

22


Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?

87


List the files associated with metadata in hdfs?

22


Define data integrity? How does hdfs ensure data integrity of data blocks stored in hdfs?

33


What is a rack awareness algorithm and why is it used in hadoop?

22


What is a block?

34


What is throughput? How does hdfs provides good throughput?

48


Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?

29


Define hadoop archives? What is the command for archiving a group of files in hdfs.

21


What do you mean by the high availability of a namenode?

15


What is the difference between nas (network attached storage) and hdfs?

37


What is a rack awareness algorithm?

47