Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How do ‘map’ and ‘reduce’ work?
Why is spark used?
What square measure the options of apache mahout?
When does queuefullexception occur?
Can spark be used without hadoop?
Explain some Disadvantages of Avro?
Define composite key?
What is difference between a MapReduce InputSplit and HDFS block
What are some of the apache pig use cases you can think of?
What advantages does Spark offer over Hadoop MapReduce?
What are use cases of Apache Flume?
How to configure the number of the Combiner in MapReduce?
Did you ever ran into a lop sided job that resulted in out of memory error
How do you process big data with spark?
Explain about transformations and actions in the context of RDDs.