Explain how big is ‘big data’?
What do you know about sequencefileinputformat?
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
Why the name ‘hadoop’?
What do you know about nlineoutputformat?
How can we change the split size if our commodity hardware has less storage space?
How is hadoop different from other data processing tools?
What happens in a textinputformat?
Can you explain how do ‘map’ and ‘reduce’ work?
Can we call vms as pseudos?
What do the master class and the output class do?
What does job conf class do?
What do you know about keyvaluetextinputformat?
Define a namenode?
Explain the hadoop-core configuration?