Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Does spark require hdfs?
What are barriers?
What are the different Eval functions available in Pig?
Double type in Hive - Important points?
What is the role of “ambari-qa” user?
What are different modes of metastore deployment in Hive?
Enlist the several components in Kafka?
What platform and java version are required to run hadoop?
Explain how you can improve the throughput of a remote consumer?
What are all stats classes in the org.apache.pig.tools.pigstats package?
What is difference between spark and mapreduce?
What is the use of mysql connector?
Why do people use spark?
what is the traditional method of message transfer?
Why should we use presto?