Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
If map reduce is inferior to spark then is there any benefit of learning it?
What are broadcast variables in Apache Spark? Why do we need them?
Mention Hive default read and write classes?
Clarify what a task tracker is in hadoop?
What is apache mahout?
What are the benefits of using Spark with Apache Mesos?
Can you explain hadoop streaming?
What is Apache Spark and what are the benefits of Spark over MapReduce?
How is spark fault tolerance?
Say what the object inspector functionality is in hive?
What are the common faults of the developer while using Apache Spark?
Is the following approach correct? Is the sqrt Of Sum Of Sq a valid reducer?
Compare Transformation and Action in Apache Spark?
What are the major differences between Hadoop 2 and Hadoop 3?
What is a table generating function on hive?