Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is ganglia is used for in ambari?
Name few companies that are the uses of apache spark?
What does a 'MapReduce Partitioner' do?
Explain the level of parallelism in spark streaming?
What do you mean by Speculative execution in Apache Spark?
Is it possible to add 100 more nodes when we already have 100 nodes in Hive?
Explain the difference between nas and hdfs?
Can spark work without hadoop?
When you point a partition of a hive table to a new directory, what happens to the data?
Features of Kafka Stream?
What is hotspotting in hbase?
What combiners is and when you should use a combiner in a MapReduce Job?
What is Spark MLlib?
What is jmx connector?
MapReduce Types and Formats and Setting up a Hadoop Cluster?