Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do shuffling do?
What is ZooKeeper Atomic Broadcast (ZAB) protocol?
What is a bag in apache pig?
Input Split & Record Reader and what they do?
Is spark better than hadoop?
How does cassandra perform read operation? Explain
What is Importance of Java in Apache Kafka?
What are the ways to create RDDs in Apache Spark? Explain.
Define a combiner?
What is tasktracker in hadoop?
What are the Hadoop features extended to its eco-system components ?
How to use Apache Zookeeper command line interface?
What is throughput in Hadoop?
What are the options-process for upgrading zookeeper?
How is spark different from hadoop?