Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the procedure for namenode recovery?
What are the different Eval functions available in Pig?
What are the key features of any nosql database?
What is full form of rdd?
List out the other components of cassandra?
How to iterate all rows in ColumnFamily?
What are the main methods of data transferring in hadoop sqoop?
What are the different execution modes available in Pig?
What is jmx? And how is it useful in cassandra?
What does flatten do in pig?
What daemons run on master nodes?
What according to you is a common mistake apache spark developers make when using spark ?
Does Apache Spark provide checkpoints?
HDFS is used for applications with large data sets, not why Many small files?
Can you mention some features of spark?