Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the role of a zookeeper in a kafka cluster?
How does one create RDDs in Spark?
What is the importance of — the split-by clause in running parallel import tasks in sqoop?
What is prepare() method in Cassandra?
What is Mapper in Hadoop?
What are impala architecture components?
What are Paired RDD?
What is difference between flume and sqoop?
explain apache hbase?
Why do we need hadoop for big data analytics?
What is the history of apache mahout? Once did it start?
What are the various input and output types supported by mapreduce?
Can you define a block and block scanner in hdfs?
What are the different composite keys in Cassandra?
What is pre-requisites for contributing to apache mahout ?