Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Do we need to install spark in all nodes?
What is an "Accumulator"?
What is a rack awareness algorithm?
What is Hive Data Definition language?
What is mahout hadoop?
What are the different Complex Data Types available in Hive?
What is the use of ycsb?
Explain Hadoop Archives?
What is the reason for creating a new metastore_db whenever Hive query is run from a different directory?
Have you ever used counters in hadoop?
How do you process big data with spark?
Does cassandra support acid tractions?
Why does my insert statement fail?
Which data storage components are used by hadoop?
What is gossip protocol in Cassandra?