Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are accumulators in Apache Spark?
What are the Benefits Of Distributed Applications?
Define speculative execution?
What are the majorly used commands in sqoop?
What is the use of context object?
How the HDFS Blocks are replicated?
How does gossip protocol work?
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
What does job conf class do?
What is flume and sqoop?
Do we need hadoop for spark?
What are different hdfs dfs shell commands to perform copy operation?
What is spark context spark session?
What is the meaning of the term "non-DFS used" in Hadoop web-console?
What is hadoop, hbase, hive and cassandra? Specify similarities and differences among them.