Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Do I need to learn scala for spark?
How to create hadoop archive?
How will you implement joins in HBase?
What is serialization in spark?
What is anti-entropy and how is it associated with merkel tree?
Explain the shuffle?
Clarify what is sqoop in hadoop?
Can you list down the limitations of using Apache Spark?
How many ways we can create rdd in spark?
How is RDD in Spark different from Distributed Storage Management?
Explain the processing speed difference between Hadoop and Apache Spark?
How does hdfs provides good throughput?
What is the importance of eval tool?
What is meant by rdd in spark?
What is the main difference between kafka and fume?