Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define compaction in HBase?
What is the optimal block size in HDFS?
Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?
What are the functions of "Spark Core"?
what are the different modes of Hive?
What is a block in HDFS, why block size 64MB?
What is spark vs hadoop?
Can multiple clients write into a Hadoop HDFS file concurrently?
What is aws spark?
What is faster than apache spark?
What is dataframe api?
How can one check whether NameNode is working or not?
What was the design goal of Cassandra?
explain Metadata in Namenode?
What is broadcast variable?