Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you mean by Stream Processing in Kafka?
What are different modes of execution in Apache Pig?
What do masters consist of?
What are the downsides of Spark?
What is Zookeeper Cluster?
What does serdes mean in apache kafka?
What is Apache Spark Streaming?
What is the primary objective of NoSQL databases?
When to use secondary indexes?
Explain the Differences between Hive and Spark SQL?
What are the different components of Cassandra?
Explain the Features of HBase?
Can hadoop handle streaming data?
What is the difference between Hiveserver1 and Hiveserver2?
Does the hdfs client decide the input split or namenode?