Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is flatmap in apache spark?
What is the role of the offset.
Define the run-time architecture of Spark?
Do we need hadoop for spark?
How ordering in hdfs is finished?
What happens when the node running the map task fails before the map output has been sent to the reducer?
Why use hadoop?
How HDFS helps NameNode in scaling in Hadoop?
What is bookkeeper?
What is the difference between client mode and cluster mode in spark?
what are the steps involved in decommissioning removing
What is flatten in pig?
What is spark lineage?
What are different logging levels in cassandra?
What does rack awareness mean?