Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is a spill factor with respect to the ram?
Can you overwrite Hadoop MapReduce configuration in Hive?
What is the biggest shortcoming of Spark?
Explain the differences between a combiner and reducer
What do you mean by the high availability of a namenode? How is it achieved?
Name different types of primary keys in Cassandra?
In Hadoop what is InputSplit?
Define a daemon?
What is a nosql database?
On what basis name node distribute blocks across the data nodes?
What is identity mapper and identity reducer?
What is heartbeat in hadoop?
Is Pig script case sensitive?
Explain how indexing in hdfs is done?
What is apache hcatalog?