How is mapreduce related to cloud computing?
Define the purpose of the partition function in mapreduce framework
What are the various input and output types supported by mapreduce?
Explain task granularity
Explain the general mapreduce algorithm
What is cassandra database used for?
List out some key features of apache cassandra?
What do you understand by bloom filter in cassandra?
What is the relationship between apache hadoop, hbase, hive and cassandra?
What do you understand by data center in cassandra?
What do you understand by column family?
What do you understand by composite type?
What are "coordinator nodes" in cassandra?
What do you understand by nosql cap theorem?
What do you understand by cql?