Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is Apache Flume?
What is the process of creating an Ambari client?
Can you list few commonly used hive services?
Can you define parquet file?
Name some independent extensions that contribute to the Ambari codebase?
Is secondary namenode a substitute to the namenode?
How are sparks created?
What is the use of flatmap in spark?
What is reduce side join in mapreduce?
Is apache spark a tool?
Have you ever used Counters in Hadoop. Give us an example scenario?
What is “serde” in “hive”?
What are the various input and output types supported by mapreduce?
What is the utilization of hcatalog?
What are the types of traditional method of message transfer?