Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which one is better hadoop or spark?
Is it necessary to write a mapreduce job in java?
In MapReduce, ideally how many mappers should be configured on a slave?
What is shuffleing in mapreduce?
What are the languages supported by apache spark?
It can be possible that a Job has 0 reducers?
When to use hadoop, hbase, hive and pig?
Explain the functionalities of ganglia in ambari?
What causes sparks?
Are Namenode and job tracker on the same host?
Why do we need indexing?
What are the core apis in kafka?
Can you explain commodity hardware?
What does connector api in kafka?
What are “Seed Nodes” in Cassandra?