How Big is ‘Big Data’?
Can you explain rack awareness?
Explain the key benefits of using storm for real time processing?
Mention what are the most common input formats defined in hadoop?
What is the difference between namenode and datanode in hadoop?
What is KeyValueTextInputFormat in Hadoop?
What are 'slaves' and 'masters' in Hadoop?
Mention what is the difference between an rdbms and hadoop?
Ideally what should be the replication factor in hadoop?
List of some best tools that can be useful for data-analysis?
what if job tracker machine is down?
What are the limitations of Hadoop?
Is it possible to provide multiple input to Hadoop? If yes then how?
Clarify what a task tracker is in hadoop?
What happens when two clients try to access the same file in the hdfs?