Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Types of Data Flow in Flume?
What is Zookeeper Cluster?
What is flume and sqoop?
What main configuration parameters are specified in mapreduce?
What is the best practice to deploy the secondary name node?
If a data Node is full how it's identified?
Explain write ahead log(journaling) in spark?
What is the benefit of kafka?
What is spark shuffle service?
In hbase what is column families?
State the difference between Spark SQL and Hql
Can you list down the limitations of using Apache Spark?
What are the drawbacks of Apache Spark?
How to Delete directory and files recursively from HDFS?
Explain hbasestorage function?