Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does impala compare to hive and pig?
What is executor memory and driver memory in spark?
When running Spark applications, is it necessary to install Spark on all the nodes of YARN cluster?
Is rdd type safe?
What is the role of “ambari-qa” user?
Explain the difference between NameNode
Explain Apache Kafka Use Cases?
What are the different commands used to startup and shutdown Hadoop daemons?
Is hbase an os independent approach?
Explain some Disadvantages of Avro?
What are consumers in kafka?
How data is spilt in Hadoop?
What does rack awareness mean?
How is HCatalog different from Hive?
how you can get exactly once messaging from Kafka during data production?