Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Say when to pick “inward table” and “outside table” in hive?
What happens if there is an error in impala?
How will you submit extra files or data ( like jars, static files, etc. ) For a mapreduce job during runtime?
Can you explain bloommapfile.
What is accumulator in spark?
How Cassandra provide High availability feature?
Explain why do we need hadoop?
What is the difference between python and spark?
Does the archiving of hive tables give any space saving in hdfs?
Why HDFS performs replication, although it results in data redundancy in Hadoop?
How the write operation is performed on Cassandra node ?
Do I need to install hadoop for spark?
What is apache spark good for?
What kind of applications is supported by Apache Hive?
How much is flume worth?