What is partioner in hadoop? Where does it run,mapper or reducer?
Answer / Babi Kumari
In Hadoop, a Partitioner is responsible for distributing key-value pairs generated during the Map phase evenly across Reducers. It runs within the MapReduce job's configuration and helps optimize data locality by ensuring that most of the data processed by a single Reducer comes from the same node. It does not run in either the Mapper or Reducer itself.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the difference between hadoop and other data processing tools?
What are watches?
What are the benefits of block transfer?
What is Disk Balancer in Apache Hadoop?
What are the most commonly defined input formats in Hadoop?
How is security achieved in Apache Hadoop?
What is a secondary namenode?
Does hadoop always require digital data to process?
What is high availability in hadoop?
What is difference between regular file system and HDFS?
Have you ever used Counters in Hadoop. Give us an example scenario?
Is hadoop open source?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)