what is JobTracker in Hadoop? What are the actions followed by Hadoop?

what is JobTracker in Hadoop? What are the actions followed by Hadoop?

Question Posted / santosh shakya

1 Answers
875 Views
I also Faced
E-Mail Answers

what is JobTracker in Hadoop? What are the actions followed by Hadoop?..

Answer / Purnendu Kumar Shukla

JobTracker is the central management service in Hadoop that coordinates all the tasks in a MapReduce job. It assigns tasks to nodes in the cluster, monitors their progress, and handles failures if they occur. The actions followed by Hadoop are:
1. Map: Each input record is processed by a mapper, which produces key-value pairs.
2. Shuffle: The output of each mapper is sorted and shuffled to the reducers based on their keys.
3. Reduce: Each reducer processes the data associated with a specific key and produces the final output.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More MapReduce Interview Questions

What is an identity mapper and identity reducer?

What happen if the number of the reducer is 0 in MapReduce?

Why Mapper runs in heavy weight process and not in a thread in MapReduce?

what is JobTracker in Hadoop? What are the actions followed by Hadoop?

Explain the differences between a combiner and reducer

What is the relation between MapReduce and Hive?

In Map Reduce why map write output to Local Disk instead of HDFS?

What is the difference between a MapReduce InputSplit and HDFS block?

how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?

what is WebDAV in Hadoop?

What is the difference between Job and Task in MapReduce?

Where is the output of Mapper written in Hadoop?

For more MapReduce Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)