Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark reducebykey?
What is a tuple in pig?
What is the use of map transformation?
What are Guarantees provided by Kafka?
What is a commodity hardware? Does commodity hardware include RAM?
Is a distributed machine learning framework on top of spark?
explain the key features of Apache Spark?
What are consumers in kafka?
Explain various Apache Spark ecosystem components. In which scenarios can we use these components?
What are the differences between a node, a cluster, and datacenter in Cassandra?
What is the use of context object?
How do I download and install spark?
What is namenode?
Explain the process to trigger automatic clean-up in Spark to manage accumulated metadata.
Can you explain about the cluster manager of apache spark?