Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
If you run a select * query in hive, why does it not run mapreduce?
What is the History of Cassandra Database ?
Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?
What is Ambari shell?
What is Distributed Cache?
What are components of ambari tjat are important for automation and integration?
What is the role zookeeper plays in a cluster of kafka?
What is a shuffle block in spark?
How to open a connection in hbase?
What do you understand about yarn?
Compare apache pig and sql?
What are different tombstone markers in hbase?
Explain values() operation in apache spark?
What is the difference between nas (network attached storage) and hdfs?
Whats the default port that jobtrackers listens ?