Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What alternate way does HDFS provides to recover data in case a Namenode
Describe how hbase uses zookeeper?
how can you debug Hadoop code?
Explain the action count() in Spark RDD?
Who invented spark?
What is a metastore in hive?
Does Pig support multi-line commands?
Which java class handles the output record encoding into files which result from Hive queries?
What is spooldir flume?
What is flume and sqoop?
How to write a Custom Key Class?
What are brokers in kafka?
What are the different life cycle commands in ambari?
Is it possible to use Kafka without ZooKeeper?
Why cloudera is used?