Mention what is the next step after mapper or maptask?
Explain what is shuffling in mapreduce?
Explain what is distributed cache in mapreduce framework?
Mention what are the main configuration parameters that user need to specify to run mapreduce job?
Explain what is the function of mapreduce partitioner?
What is map/reduce job in hadoop?
What is a mapreduce algorithm?
What is mapreduce algorithm?
When the reducers are are started in a mapreduce job?
Where the mapper's intermediate data will be stored?
Which are the methods in the mapper interface?
What is an identity mapper and identity reducer?
What are the advantages of using mapreduce with hadoop?
What do you know about nlineinputformat?
Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?