Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can you define a checkpoint?
What is isr?
What is spark configuration?
Explain the maximum size of a message that can be received by the Kafka?
How to transfer data from Hive to HDFS?
What is spark technology?
Specify Cassandra’s importance on Facebook?
Name the languages which are supported by apache spark and which one is most popular?
Does Partitioner run in its own JVM or shares with another process?
How is Spark not quite the same as MapReduce? Is Spark quicker than MapReduce?
What are the different modes in which PIG can run and explain those?
Mention what are the main components of cassandra data model?
What is rack awareness in hadoop?
What database are supported by Hive?
Explain about catalog configuration?