What is the jobtracker and what it performs in a hadoop cluster?
How many instances of tasktracker run on a hadoop cluster?
Explain the use of tasktracker in the hadoop cluster?
How does a namenode handle the failure of the data nodes?
What do you know about sequencefileinputformat?
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
Why the name ‘hadoop’?
What do you know about nlineoutputformat?
How can we change the split size if our commodity hardware has less storage space?
How is hadoop different from other data processing tools?
What happens in a textinputformat?
Can you explain how do ‘map’ and ‘reduce’ work?
Can we call vms as pseudos?
What do the master class and the output class do?
What does job conf class do?