Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Clarify what combiners are and when you should utilize a combiner in a map reduce job?
What is flume instagram?
What does job conf class do?
what is distributed cache in mapreduce framework?
How blocks are distributed among all data nodes for a particular chunk of data?
How much Metadata will be created on NameNode in Hadoop?
What are the exception handling operators in Pig script?
Where is spark rdd?
What purpose would an engineer use spark?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
How does speculative execution work in Hadoop?
Mention what is data cleansing?
What is salary of hadoop developer?
What are the important differences between apache and hadoop?
Explain the role of the offset?