Explain combiners.
Answer / Jwala Singh
Combiners in MapReduce are a technique used to optimize the sorting phase. They allow intermediate key-value pairs to be aggregated on each mapper node before sending them to the reducer nodes for final aggregation. This reduces the amount of data that needs to be shuffled between mappers and reducers, thereby improving the overall performance and reducing network traffic.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the next step after Mapper or MapTask?
how indexing in HDFS is done?
what are the main configuration parameters that user need to specify to run Mapreduce Job ?
What is the best way to copy files between HDFS clusters?
What is difference between a MapReduce InputSplit and HDFS block
How to create custom key and custom value in MapReduce Job?
How can we control particular key should go in a specific reducer?
MapReduce Types and Formats and Setting up a Hadoop Cluster?
Is it possible to split 100 lines of input as a single split in MapReduce?
What main configuration parameters are specified in mapreduce?
What is a IdentityMapper and IdentityReducer in MapReduce ?
Explain task granularity
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)