What is RecordReader in a Map Reduce?
What is the difference between Reducer and Combiner in Hadoop MapReduce?
What is a partitioner and how the user can control which key will go to which reducer?
What do you know about nlineinputformat?
What is SequenceFileInputFormat?
What is mapreduce algorithm?
How to create a custom key and custom value in MapReduce Job?
How do reducers communicate with each other?
Explain about the partitioning, shuffle and sort phase in MapReduce?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
How to configure the number of the Combiner in MapReduce?
What is the Hadoop MapReduce API contract for a key and value Class?
Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?
Explain how do ‘map’ and ‘reduce’ work?
How to write a custom partitioner for a Hadoop MapReduce job?