Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What do you mean by data locality?
What is Block in HDFS?
Why do the nodes are removed and added frequently in a hadoop cluster?
Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?
Explain InputSplit in Hadoop MapReduce?
While reading data from hbase, from which three places data will be reconciled before returning the value?
Who uses Cassandra?
What is the difference between structured and unstructured big data?
What are the optimization techniques in spark?
Why is Cassandra popular? Clarify.
List out some key features of apache cassandra?
What is apache spark and what is it used for?
Why hive does not store metadata information in hdfs?
Explain the difference between a MapReduce InputSplit and HDFS block?
How does a client read/write data in HDFS?