adspace
What is the problem with small files in Apache Hadoop?
Answer Posted / Gaurav Nidhar
The problem with small files in Apache Hadoop is that they can negatively impact the performance of Hadoop Distributed File System (HDFS). This is due to the overhead associated with maintaining metadata for each file, which increases proportionally with the number of files, regardless of their size. Also, small files may not fully utilize the HDFS block size, leading to inefficient use of storage and network bandwidth.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers