What is partitioner and its usage?
Answer / Sidharth Gautam
In MapReduce, a partitioner is a component that divides the input data into smaller pieces (partitions) for processing by multiple mappers. The partitioner ensures that similar data ends up on the same mapper to improve efficiency.
| Is This Answer Correct ? | 0 Yes | 0 No |
Mention what are the main configuration parameters that user need to specify to run mapreduce job?
What comes in Hadoop 2.0 and MapReduce V2 YARN
What is a "reducer" in Hadoop?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
Why Hadoop MapReduce?
What is reduce side join in mapreduce?
How does fault tolerance work in mapreduce?
What are the steps involved in MapReduce framework?
What is the difference between Hadoop and RDBMS?
What are the four basic parameters of a reducer?
What is a distributed cache in mapreduce framework?
How to submit extra files(jars, static files) for Hadoop MapReduce job during runtime?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)