Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain Spark countByKey() operation?
What is a spill factor with respect to the ram?
When to use secondary indexes?
Explain how does hbase actually delete a row?
What is zeromq?
What is combiner aggregator?
Can impala be used for complex event processing?
What is sink processors?
Ideally what should be the block size in hadoop?
Highlight the key differences between MapReduce and Apache Pig?
Mention what does the text input format do?
Can Hadoop be compared to NOSQL database like Cassandra?
What is a rack?
Can you explain hadoop streaming?
Explain Working of MapReduce?