Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
763What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
714Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
474
Explain how mapreduce works.
What is the difference between External and Internal Table in Hive?
What are the problems with Hadoop 1.0?
What is throughput in HDFS?
State benefits of Hadoop users by using Apache Ambari?
State the difference between Spark SQL and Hql
What is the InputFormat ?
Say what the views are in hive?
Give me examples of unstructured data?
How is impala metadata managed?
List out the different stream grouping in apache storm?
When would you use hbase?
What are the types of traditional method of message transfer?
Why is BlinkDB used?
What is rack-aware replica placement policy?