Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the advantages of pig language?
What is apache presto?
How does Hadoop Classpath plays a vital role in stopping or starting in Hadoop daemons?
What is zeromq?
Is hive an impala requirement?
What is combiner aggregator?
Why would nosql be better than using a sql database? And how much better is it?
What do you mean by schema on read?
What platform and Java version is required to run Hadoop?
How Apache Pig deals with the schema and schema-less data?
Does spark use yarn?
How can you add a new partition for the month December in the above partitioned table?
Define primary key in Apache Cassandra?
What are the 2 modes used to run pig scripts?
Can you explain the term, Cassandra?