Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the Reducer's Sort phase?
what is composite type in cassandra?
What is the difference between apache mahout and prediction.io ?
What are the main components of a Hadoop Application?
What is the advantage of cassandra?
What is Flume Client?
Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?
What is the difference between Apache Pig and Hive?
Mention what is the data storage component used by hadoop?
Is spark and hadoop same?
What are the languages supported by apache spark?
What is spark yarn executor memoryoverhead?
What are the benefits of lazy evaluation?
What is the purpose of sqoop-merge?
shouldn't DFS be able to handle large volumes of data already?