Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is node in Cassandra?
What is Clustring in Hive?
Define a worker node?
Is it possible to use same metastore by multiple users, in case of embedded hive?
What is distributed copy (distcp)?
What are the main properties of hdfs-site.xml file?
Can you explain recommendation engine?
what job does the conf class do?
Can I do insert … select * into a partitioned table?
How does job tracker schedule a job for the task tracker?
How is rdd fault?
What is ng in flume?
What OS Cassandra supports?
How is Apache Spark better than Hadoop?
How would you drop a table in Hive?