Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What platform and java version are required to run hadoop?
Which object can be used to get the progress of a particular job
Does the HDFS go wrong? If so, how?
How can I speed up my spark?
How to perform the inter-cluster data copying work in HDFS?
What are the different zkclientbindings?
how indexing in HDFS is done?
What is the difference between input split and hdfs block?
What are the main configuration parameters in a MapReduce program?
How you can use Akka with Spark?
What does reduce action do?
What is a Combiner?
Differentiate between Pig Latin and Pig Engine?
What are the components of Apache Spark Ecosystem?
What is the sequencefileinputformat in hadoop?