What do you mean by data locality?
What main configuration parameters are specified in mapreduce?
What do you understand by the term Straggler ?
What is a MapReduce Combiner?
What is partitioning in MapReduce?
What is Text Input Format?
For a Hadoop job, how will you write a custom partitioner?
Explain the differences between a combiner and reducer
Explain what is shuffling in mapreduce?
Which are the methods in the mapper interface?
What is the need of MapReduce?
What is the difference between an RDBMS and Hadoop?
What is the purpose of textinputformat?
How do you stop a running job gracefully?
how JobTracker schedules a task ?