Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) How does a client read/write data in HDFS?
Is there any benefit of learning MapReduce, then?
What is spark database?
What is a MapFile?
Why hbase is a schema-less database?
what is meaning Replication factor?
What is the use of having Filters in Apache Pig ?
What does it indicate if replica stays out of ISR for a long time?
What is a Secondary Namenode? Is it a substitute to the Namenode?
What is the difference between leader and follower in kafka?
What is client mode in spark?
How to submit extra files(jars, static files) for MapReduce job during runtime?
How is hadoop different from spark?
Difference between external table and internal table in HIVE ?
Can rdd be shared between sparkcontexts?