Explain about the partitioning, shuffle and sort phase
No Answer is Posted For this Question
Be the First to Post Answer
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
What is the difference between a MapReduce InputSplit and HDFS block?
How to write a custom partitioner for a Hadoop MapReduce job?
What happens when a DataNode fails during the write process?
What is the difference between Reducer and Combiner in Hadoop MapReduce?
How to overwrite an existing output file during execution of mapreduce jobs?
What are the identity mapper and reducer in MapReduce?
Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?
When is it not recommended to use MapReduce paradigm for large scale data processing?
It can be possible that a Job has 0 reducers?
Explain InputSplit in Hadoop MapReduce?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)