Apache HDFS Hadoop Distributed File System Interview Questions
Questions Answers Views Company eMail

What is the procedure to create users in HDFS and how to allocate Quota to them?

23

How does a client read/write data in HDFS?

24

What is the difference between MapReduce engine and HDFS cluster?

77

Explain NameNode and DataNode in HDFS?

34

How data or file is read in HDFS?

24

How is indexing done in HDFS?

26

What alternate way does HDFS provides to recover data in case a Namenode, without backup, fails and cannot be recovered?

20

Why HDFS performs replication, although it results in data redundancy?

47

Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?

22

What are the key features of HDFS?

26

How can one set space quota in Hadoop (HDFS) directory?

17

Explain how HDFS communicates with Linux native file system?

26

If I create a folder in HDFS, will there be metadata created corresponding to the folder? If yes, what will be the size of metadata created for a directory?

20

Replication causes data redundancy then why is is pursued in HDFS?

25

What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?

23


Post New Apache HDFS Hadoop Distributed File System Questions

Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

What is the throughput?

31


What is the optimal block size in HDFS?

30


Can multiple clients write into a Hadoop HDFS file concurrently?

33


Which classes are used by the hive to read and write hdfs files?

46


Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?

39






How data or file is written into HDFS?

37


Differentiate HDFS & HBase?

28


What is throughput in HDFS?

47


How does HDFS ensure Data Integrity of data blocks stored in HDFS?

30


How HDFS client divide the file into the block while storing inside HDFS?

50


Replication causes data redundancy then why is is pursued in HDFS?

25


If data is present in HDFS and RF is defined, then how can we change Replication Factor?

24


Since the data is replicated thrice in hdfs, does it mean that any calculation done on one node will also be replicated on the other two?

28


Explain what is a difference between an input split and hdfs block?

20


How does a client read/write data in HDFS?

24