Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) explaine wal in hbase?
What is the primary objective of NoSQL databases?
What is CAP Theorem? What aspects does Hadoop support from this theorem?
How is streaming implemented in spark?
How will you write a custom partitioner for a Hadoop job?
Why HDFS?
Name some companies that are already using Spark Streaming?
What are the data components used by Hadoop?
Define Thrift?
Is hadoop mandatory for spark?
List the various components in kafka?
How do you run pig scripts on kerberos secured cluster?
How do you handle compression in pig?
What combiners are and when you should utilize a combiner in a map reduce job?
What is apache flume used for?