What is difference between a MapReduce InputSplit and HDFS block



What is difference between a MapReduce InputSplit and HDFS block..

Answer / Vinita Chaudhary

An HDFS block is a physical segment of data stored in Hadoop Distributed File System. An InputSplit, on the other hand, is a logical partition of the data that MapReduce uses to divide the work among multiple mappers. One or more HDFS blocks can contribute to an InputSplit, and the size of an InputSplit doesn't necessarily match the size of an HDFS block.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More MapReduce Interview Questions

What is the Hadoop MapReduce API contract for a key and value Class?

1 Answers  


mapper or reducer?

1 Answers  


What is a Speculative Execution in Hadoop MapReduce?

1 Answers  


Why MapReduce uses the key-value pair to process the data?

1 Answers  


What is shuffling and sorting in mapreduce?

1 Answers  


Explain how do ‘map’ and ‘reduce’ work?

1 Answers  


Explain the differences between a combiner and reducer

1 Answers  


What is the key- value pair in MapReduce?

1 Answers  


Why can aggregation not be done in Mapper in MapReduce?

1 Answers  


What are the main components of MapReduce Job?

1 Answers  


Explain what combiners are and when you should use a combiner in a mapreduce job?

1 Answers  


Why is output file name in Hadoop MapReduce part-r-00000?

1 Answers  


Categories