What is Partioner in hadoop? Where does it run
Answer / Varun Tomar
Partitioner in Hadoop is a component responsible for partitioning the data based on a specific key. The default partitioner in Hadoop is the HashPartitioner, which partitions data based on the hash value of the key. When a MapReduce job runs, the partitioner determines which parts of the data should be assigned to which reducer. The partitioner runs on the JobTracker/ResourceManager node in Hadoop 1 and on the YARN ResourceManager in Hadoop 2.
| Is This Answer Correct ? | 0 Yes | 0 No |
How will you make changes to the default configuration files?
Is secondary namenode a substitute to the namenode?
What is a block and block scanner in HDFS?
What is the difference between HDFS and NAS ?
What are the modules that constitute the Apache Hadoop 2.0 framework?
How Mapper is instantiated in a running job?
What is Distributed Cache?
What are the different types of Znodes?
What is a namenode? How many instances of namenode run on a hadoop cluster?
How many maximum jvm can run on a slave node?
Does hadoop always require digital data to process?
Can Hadoop be compared to NOSQL database like Cassandra?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)