Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is accumulator in spark?
How to iterate all rows in ColumnFamily?
Do you know the comparative differences between apache spark and hadoop?
Explain the difference between COUNT_STAR and COUNT functions in Apache Pig?
What is shuffle in spark?
Explain the features of pseudo mode?
Is spark sql faster than hive?
Why is Cassandra popular? Clarify.
Which Sorting algorithm is used in Hadoop MapReduce?
What operations RDD support?
Does spark store data?
What is a pipelinedrdd?
Is hive similar to sql?
Is scala required for spark?
What does producer api in kafka?