adspace


What is the problem with small files in Apache Hadoop?

Answer Posted / Gaurav Nidhar

The problem with small files in Apache Hadoop is that they can negatively impact the performance of Hadoop Distributed File System (HDFS). This is due to the overhead associated with maintaining metadata for each file, which increases proportionally with the number of files, regardless of their size. Also, small files may not fully utilize the HDFS block size, leading to inefficient use of storage and network bandwidth.

Is This Answer Correct ?    0 Yes 0 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

How you can contact your client everyday ?

1042


did you maintain the hadoop cluster in-house or used hadoop in the cloud?

1076