What is the difference between an inputsplit and a block?



What is the difference between an inputsplit and a block?..

Answer / Akash Kumar Srivastava

An InputSplit in Hadoop represents a part of the data that a mapper or reducer will process. It doesn't necessarily correspond to a physical file block. A Block, on the other hand, is a contiguous set of data stored within a DataNode and has a fixed size (128 MB by default). Multiple InputSplits can be assigned to a single Block during data processing.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Hadoop General Interview Questions

what happens when namenode goes down during file read operation in hadoop?

1 Answers  


What does hadoop-metrics.properties file do?

1 Answers  


Explain is it possible to search for files using wildcards?

1 Answers  


How often DataNode send heartbeat to NameNode in Hadoop?

1 Answers  


What is a rack?

1 Answers  


What a task tracker is in hadoop?

1 Answers  


How analysis of Big Data is useful for organizations?

1 Answers  


Clarify how job tracker schedules an assignment?

1 Answers  


What is CAP Theorem? What aspects does Hadoop support from this theorem?

1 Answers  


Can you explain combiner?

1 Answers  


Explain the Single point of Failure in Hadoop?

1 Answers  


Mention what daemons run on a master node and slave nodes?

1 Answers  


Categories