Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does apache spark stand for?
What is shuffleing in mapreduce?
What are some alternatives to apache kafka?
What is a broker?
What is the difference between input split and hdfs block?
What is ObjectInspector functionality?
How does spark run hadoop?
Can you explain benefits of spark over mapreduce?
What is the difference between Internal Table and External Table in Hive?
What is key-value store db? Explain with an example.
What is the difference between Input Split and an HDFS Block?
What do you understand by SchemaRDD?
Define data integrity?
What are the components of a Hive query processor?
What is "GraphX" in Spark?