Apache HDFS Hadoop Distributed File System Interview Questions
Questions Answers Views Company eMail

Explain the hdfs architecture?

39

How does hdfs provides good throughput?

28

Can we have different replication factor of the existing files in hdfs?

38

How will you perform the inter cluster data copying work in hdfs?

23

Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?

88

List the files associated with metadata in hdfs?

22

Define data integrity? How does hdfs ensure data integrity of data blocks stored in hdfs?

33

What is a rack awareness algorithm and why is it used in hadoop?

22

What is a block?

34

What is throughput? How does hdfs provides good throughput?

48

Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?

29

Define hadoop archives? What is the command for archiving a group of files in hdfs.

21

What do you mean by the high availability of a namenode?

15

What is the difference between nas (network attached storage) and hdfs?

37

What is a rack awareness algorithm?

47


Post New Apache HDFS Hadoop Distributed File System Questions

Un-Answered Questions { Apache HDFS Hadoop Distributed File System }

How is hdfs block size different from traditional file system block size?

36


While processing data from hdfs, does it execute code near data?

26


How to split single hdfs block into partitions rdd?

29


What do you mean by the high availability of a namenode?

15


Why do we need hdfs?

44






If data is present in HDFS and RF is defined, then how can we change Replication Factor?

24


Explain the difference between an hdfs block and input split?

49


What is hdfs in big data?

31


What are problems with small files and hdfs?

20


Define data integrity?

20


Explain the difference between mapreduce engine and hdfs cluster?

36


What is a rack awareness algorithm and why is it used in hadoop?

22


How is indexing done in HDFS?

26


Can multiple clients write into a Hadoop HDFS file concurrently?

33


List the files associated with metadata in hdfs?

22