Define a datanode?
Whats is distributed cache in hadoop?
What is hadoop framework?
Define streaming?
Can hive run without hadoop?
What is a spill factor with respect to the ram?
Which are the two types of 'writes' in HDFS?
What are combiners and its purpose?
What are the core components of Apache Hadoop?
What is Apache Hadoop YARN?
Define a job tracker?
What is the problem with small files in Apache Hadoop?
Explain the core components of hadoop?
What exactly is hadoop?
What is the default replication factor?