Explain Working of MapReduce?
Answer / Ankit
MapReduce is a programming model and software framework for processing large data sets with parallel computation. It breaks the input data into smaller pieces (splits), applies a user-defined map function to each split, then shuffles and sorts the results to a set of reducers that perform a reduce function on the data.
| Is This Answer Correct ? | 0 Yes | 0 No |
Why Mapper runs in heavy weight process and not in a thread in MapReduce?
Can we set the number of reducers to zero in MapReduce?
What is the best way to copy files between HDFS clusters?
what are the most common input formats defined in Hadoop?
What is the difference between RDBMS with Hadoop MapReduce?
what are the basic parameters of a Mapper?
What is the default input type in MapReduce?
In MapReduce how to change the name of the output file from part-r-00000?
What is a scarce system resource?
In Map Reduce why map write output to Local Disk instead of HDFS?
What are the configuration parameters in the 'MapReduce' program?
What is the input type/format in MapReduce by default?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)