Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Where sorting is done in Hadoop MapReduce Job?
What do you understand by the super column in cassandra?
What are Flume events?
What advantages does Spark offer over Hadoop MapReduce?
Explain countByValue() operation in Apache Spark RDD?
What are the Basics of Hadoop?
What does secondary name-node means?
What is the ZooKeeper ensemble?
Compare MapReduce and Spark?
What is Writable & WritableComparable interface?
What are the file formats supported by spark?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
State benefits of Hadoop users by using Apache Ambari?
What are different hdfs dfs shell commands to perform copy operation?
What is the current version of Hive?