Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What daemons run on master nodes?
Explain HBase Meta Table?
Will various customers write into an hdfs record simultaneously?
What is cluster mode in spark?
Differentiate Reducer and Combiner in Hadoop MapReduce?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
What is Distributed Cache in Hadoop?
What problems have you faced when you are working on Hadoop code?
Differentiate between GROUP and COGROUP operators?
What is the Physical plan in pig architecture?
How can you overwrite the replication factors in HDFS?
What is Cassandra Data Modelling ?
How can you connect an application
What is skew data?
What is difference between spark and hadoop?