Mention how hadoop is different from other data processing tools?
Mention what are the most common input formats defined in hadoop?
Mention what is rack awareness?
Mention what is the difference between an rdbms and hadoop?
Mention what are the three modes in which hadoop can be run?
Explain how does hadoop classpath plays a vital role in stopping or starting in hadoop daemons?
Explain what is storage and compute nodes?
Explain how jobtracker schedules a task?
Explain what happens when hadoop spawned 50 tasks for a job and one of the task failed?
What happens in text format?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
Explain what is a task tracker in hadoop?
Explain what is speculative execution?
Explain what happens in text format?
Mention what is distributed cache in hadoop?