Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is difference between flume and kafka?
Discuss the role of Spark driver in Spark application?
What is a difference between an input split and hdfs block?
Does Hive support record level Insert, delete or update?
Can you explain the common input formats in hadoop?
Doesn’t google have its very own version of dfs?
What is Sqoop Job?
What is a row in cassandra?
Explain first() operation in Apache Spark RDD?
What are channel selectors?
How multi-hop agent can be setup in Flume?
What is bookkeeper?
explain apache hbase?
Should I install spark on all nodes of yarn cluster?
What is spark checkpointing?