Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
shouldn't DFS be able to handle large volumes of data already?
Explain the filter transformation?
In which scenario Pig is better fit than MapReduce?
What are the differences between Caching and Persistence method in Apache Spark?
What is the role of zookeeper in hbase?
What exactly kafka does?
What is off heap memory in spark?
What is a bookkeeper client in bookkeeper?
What combiners are and when you should use a combiner in a mapreduce job?
What do you understand by Filters in HBase?
What do you understand by standalone (or local) mode?
What is the zookeeper daemon name?
What is a tuple?
what is Zookeeper in Kafka? Can we use Kafka without Zookeeper?
Name some best features of Ambari?