Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is a rack awareness algorithm?
Which code do we use to open the connection in Hbase?
Explain the process for starting a kafka server?
Which is the best spark certification?
What are the three layers where the hadoop components are actually supported by ambari?
What is shuffle read and shuffle write in spark?
What is Output Format in MapReduce?
How HCatalog helps to capture processing states to enable sharing?
Which is better hadoop or spark?
What are benefits of Spark over MapReduce?
How to copy file from HDFS to local?
Is apache spark an etl tool?
What is Hive query processor?
Which are the elements of kafka?
What is Fault Tolerance in HDFS?