Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is RDD?
Why do we need spark?
How does job tracker schedule a job for the task tracker?
What are the main hdfs-site.xml properties?
how you can reduce churn in ISR? When does broker leave the ISR?
What is Hive Data Definition language?
What is the difference between apache mahout and prediction.io ?
Why MapReduce uses the key-value pair to process the data?
How does speculative execution work in Hadoop?
Why is block size large in Hadoop?
What is difference between hive and hdfs?
What are the uses and applications of mahout ?
Who is intended audience to learn HCatalog?
Which operating system(s) are supported for production hadoop deployment?
What are combiners and its purpose?