Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is a checkpoint?
What is spark table?
Name the management tools in Cassandra?
Mention what are the most common input formats defined in hadoop?
Can you define rdd lineage?
What is jmx? And how is it useful in cassandra?
What is the replication factor?
What do you understand by Data Replication in Cassandra?
What is the use of Bloom Filter in Cassandra?
Explain data flow in Flume?
What is the relationship between hdfs, hbase, pig, hive and azkaban?
Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?
Where does spark plug get power?
What is partitioning in MapReduce?
What is an identity mapper and identity reducer?