Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

What is the procedure to create users in HDFS and how to allocate Quota to them?

23


How does a client read/write data in HDFS?

24


What is the difference between MapReduce engine and HDFS cluster?

77


Explain NameNode and DataNode in HDFS?

34


How data or file is read in HDFS?

24


How is indexing done in HDFS?

26


What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?

20


Why HDFS performs replication, although it results in data redundancy?

47


Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?

22


What are the key features of HDFS?

26


How can one set space quota in Hadoop (HDFS) directory?

17


Explain how HDFS communicates with Linux native file system?

26


If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?

20


Replication causes data redundancy then why is is pursued in HDFS?

25


What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?

23