What is the basic difference between traditional RDBMS and Hadoop?
How many daemon processes run on a hadoop cluster?
What is Fault Tolerance?
What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
Explain what is hadoop?
Explain Erasure Coding in Hadoop?
What does name-node mean in hadoop?
What is the number of default partitioner in hadoop?
How to do ‘map’ and ‘reduce’ works?
What do you understand by standalone (or local) mode?
Is Namenode machine same as DataNode machine as in terms of hardware?
What is throughput in Hadoop?
Why do we need Hadoop?
Can we change the file cached by distributed cache
According to IBM, what are the three characteristics of Big Data?