Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is mapreduce algorithm?
What is hive installation path?
How many types of rdd are there in spark?
What is a spill factor with respect to the ram?
What hadoop does in safe mode?
What is a dstream in apache spark?
Would you be able to change the block size of hdfs files?
How many partitions are created by default in Apache Spark RDD?
What does it mean by Columnar Storage Format?
Explain how can spark be connected to apache mesos?
What is hector?
What are the core components of Apache Hadoop?
Why do I have to use refresh and invalidate metadata, what do they do?
What problem does Apache Pig solve?
What is the meaning of the term "non-DFS used" in Hadoop web-console?