Can you explain how do ‘map’ and ‘reduce’ work?



Can you explain how do ‘map’ and ‘reduce’ work?..

Answer / Amar

The MapReduce programming model in Apache Hadoop is a software framework for processing large datasets. It consists of two main components: Map and Reduce.n
1. Map: The map function takes input data (key-value pairs) as input, performs a transformation on each record independently, and generates intermediate key-value pairs as output. This process is distributed across multiple nodes in the Hadoop cluster.n
2. Reduce: After the map phase, all intermediate key-value pairs are sorted by their keys. The reduce function then takes these sorted sets of values for each key and combines them to produce a smaller set of key-value pairs as output. This step is also distributed across multiple nodes, but unlike the map phase, the same reduce task processes all data associated with a given key.nnIn simple terms, the map function transforms input data, and the reduce function aggregates the results.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Hadoop Interview Questions

Explain what happens in textinformat ?

1 Answers  


How is HDFS fault tolerant?

1 Answers  


What problems can be addressed by using Zookeeper?

1 Answers  


Can hbase run without hadoop?

1 Answers  


Why is hadoop faster?

1 Answers  


What is formatting of the dfs?

1 Answers  


What is the port number for NameNode

1 Answers  


How will format the HDFS ?

1 Answers  


Is a job split into maps?

1 Answers  


What is the purpose of button groups?

1 Answers  


What is a namenode? How many instances of namenode run on a hadoop cluster?

1 Answers  


How many JVMs run on a slave node?

1 Answers  


Categories