How to sort intermediate output based on values in MapReduce?
Answer / Roushan Kumar Tiwary
In Hadoop MapReduce, the intermediate output is sorted automatically based on keys. To sort the output by values, you can implement a custom comparator for the value class and use it when defining the input format or output format.
| Is This Answer Correct ? | 0 Yes | 0 No |
In MapReduce, ideally how many mappers should be configured on a slave?
Mention when to use Map reduce mode?
Different ways of debugging a job in MapReduce?
How hadoop mapreduce works?
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?
What is an input reader in reference to mapreduce?
What is identity mapper and identity reducer?
How many Reducers run for a MapReduce job?
what is a sequence file in Hadoop?
What is a "reducer" in Hadoop?
How to set which framework would be used to run mapreduce program?
Explain what does the conf.setMapper Class do in MapReduce?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)