What is the problem in having lots of small files in hdfs?
Answer Posted / Awadhesh Kumar Singh
In HDFS (Hadoop Distributed File System), managing a large number of small files can lead to inefficient storage utilization and increased metadata overhead. This is because each file has associated metadata, such as filename, replication information, access permissions, etc., which takes up space and processing time. Moreover, small files may not be efficiently stored across datanodes due to block-level organization, leading to increased network traffic for data retrieval.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
No New Questions to Answer in this Category !! You can
Post New Questions
Answer Questions in Different Category