Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is a “Distributed Cache” in Apache Hadoop?
What does the high availability of a name-node means? How is it accomplished?
How businesses could be benefitted with Big Data?
What is difference between map and flatmap?
Explain the CLI In Zookeeper?
What is Chain Mapper?
How to set mappers and reducers for MapReduce jobs?
What is a IdentityMapper and IdentityReducer in MapReduce ?
Explain small file problem in hadoop
When does impala hold on to or return memory?
What is the utilization of hcatalog?
Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?
What is the use of cassandra and why to use cassandra?
Can you explain the core methods of a reducer?
What is the relation between MapReduce and Hive?