Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define streaming?
How does a namenode handle the failure of the data nodes?
Explain how cassandra writes changed data into commitlog?
What is RDD lineage graph? How does it enable fault-tolerance in Spark?
Clarify what is sqoop in hadoop?
With the help of two examples name the map and reduce function purpose
What is CONCATENATE command in Hive?
What is metadata storage service in bookkeeper?
How can you use producer api code?
Give examples of the SerDe classes whihc hive uses to Serializa and Deserilize data?
What is ZooKeeper Client?
Is spark good for machine learning?
What does conf.setmapper class do?
How does spark program work?
What is spark written?