What is difference between split and block in hadoop?
How many JVMs run on a slave node?
What is a block and block scanner in HDFS?
What is compute and Storage nodes?
how would you modify that solution to only count the number of unique words in all the documents?
How can you overwrite the replication factors in HDFS?
What are the modules that constitute the Apache Hadoop 2.0 framework?
Knox and Hadoop Development Tools?
In which location Name Node stores its Metadata and why?
Is a job split into maps?
What are different hdfs dfs shell commands to perform copy operation?
If a data Node is full how it's identified?
How many filters are available in HBase?
What are the site-specific configuration files in Hadoop?
On what basis data will be stored on a rack?