Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the purpose of RawComparator interface?
When to choose "External Table" in Hive?
What is Rack awareness?
How hive can improve performance with orc format tables?
What is the role of a zookeeper in a kafka cluster?
What are the limitations of Spark?
When to use hadoop, hbase, hive and pig?
What happens if you alter the block size of a column family on an already occupied database?
What is the significance of ‘IF EXISTS” clause while dropping a table?
Explain keys() operation in Apache spark?
Define the difference between hive and hbase?
What is a "map" in Hadoop?
We have already sql then why nosql?
Can you explain the term, Cassandra?
What is Sqoop?