How to optimize MapReduce Job?



How to optimize MapReduce Job?..

Answer / Shamshul

Optimizing a MapReduce job involves various techniques such as: using Combiner functions, reducing the number of shuffles by combining map and reduce tasks (combineFile), setting appropriate record reader and writer for input and output formats, increasing the number of reducers based on data size, etc.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More MapReduce Interview Questions

What are the benefits of Spark over MapReduce?

1 Answers  


How to write MapReduce Programs?

1 Answers  


How is reporting controlled in hadoop?

1 Answers  


Is it possible to split 100 lines of input as a single split in MapReduce?

1 Answers  


Can there be no Reducer?

1 Answers  


What is the key- value pair in MapReduce?

1 Answers  


Define the purpose of the partition function in mapreduce framework

1 Answers  


How to write a custom partitioner for a Hadoop MapReduce job?

1 Answers  


What is identity mapper and reducer? In which cases can we use them?

1 Answers  


Can we rename the output file?

1 Answers  


What do you mean by data locality?

1 Answers  


Explain what is the function of mapreduce partitioner?

1 Answers  


Categories