What is Partioner in hadoop? Where does it run

What is Partioner in hadoop? Where does it run

Question Posted / kunal

1 Answers
1071 Views
I also Faced
E-Mail Answers

What is Partioner in hadoop? Where does it run..

Answer / Varun Tomar

Partitioner in Hadoop is a component responsible for partitioning the data based on a specific key. The default partitioner in Hadoop is the HashPartitioner, which partitions data based on the hash value of the key. When a MapReduce job runs, the partitioner determines which parts of the data should be assigned to which reducer. The partitioner runs on the JobTracker/ResourceManager node in Hadoop 1 and on the YARN ResourceManager in Hadoop 2.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Hadoop Interview Questions

How will you make changes to the default configuration files?

Is secondary namenode a substitute to the namenode?

What is a block and block scanner in HDFS?

What is the difference between HDFS and NAS ?

What are the modules that constitute the Apache Hadoop 2.0 framework?

How Mapper is instantiated in a running job?

What is Distributed Cache?

What are the different types of Znodes?

What is a namenode? How many instances of namenode run on a hadoop cluster?

How many maximum jvm can run on a slave node?

Does hadoop always require digital data to process?

Can Hadoop be compared to NOSQL database like Cassandra?

For more Apache Hadoop Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)