What happens to zk sessions while the cluster is down?
How Spark handles monitoring and logging in Standalone mode?
Mention what is the data storage component used by hadoop?
what does /*streamtable(table_name)*/ do?
What is the key- value pair in MapReduce?
What is NoSQL?
What are the different Primitive Data Types available in Hive?
It can be possible that a Job has 0 reducers?
Explain what happens if, during the PUT operation, HDFS block is assigned a replication factor 1 instead of the default value 3?
Can ambari manage multiple clusters and why?
What are the main hdfs-site.xml properties?
What happens when the node running the map task fails before the map output has been sent to the reducer?
Does spark use yarn?
Give some advantages of Cassandra?
What are the configuration parameters in the 'MapReduce' program?