Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does connector api in kafka?
Why is flume used?
What is hadoop, hbase, hive and cassandra? Specify similarities and differences among them.
how will you implement SQL in Spark?
How many types of tunable consistency are supported in Cassandra?
What are the data manipulation commands of hbase?
What are benefits of Spark over MapReduce?
Describe Partition and Partitioner in Apache Spark?
Can we change the document present in hdfs?
How many Reducers run for a MapReduce job in Hadoop?
What is map in spark?
Replication causes data redundancy then why is is pursued in HDFS?
How is the splitting of file invoked in Hadoop framework?
What is the role of a zookeeper in a kafka cluster?
Why should we use presto?