Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are main APIs of Kafka?
Why spark is used?
Tell me some major benefits of Hadoop?
According to IBM, what are the three characteristics of Big Data?
Mention what needs to be taken care while adding a column?
What is spark yarn executor memoryoverhead?
What is apache hcatalog?
What are producer-consumer queues?
Does Hadoop requires RAID?
Can you explain worker node?
How can I install Cloudera VM in my system?
What are accumulators in spark?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
What are the operational commands of HBase?
How is data represented in Spark?